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NUCXEIC ACmS OF THE HUMAN ABCA5, ABCA6, ABCA9, and ABCAIO 
GENES, VECTORS CONTAINING SUCH NUCLEIC ACIDS AND USES 

THEREOF 

5 The present invention relates to novel nucleic acids corresponding to ABCAS, 

ABCA6, ABCA9, and ABCAIO gmes, and cDNAs encoding novel ABCA5, ABCA6, 
ABCA9, and ABCAIO proteins. The invention also relates to means for the detection of 
polymorphisms in general, and mutations in particular in the ABCA5, ABCA6, 
ABCA9, and ABCAIO genes or corresponding proteins produced by the allelic forms of 

10 the ABCA5, ABCA6, ABCA9, and ABCAIO genes. 

The ABC (ATP-binding cassette transporter) gene superfamily encode active 
transporter proteins and constitute a family of proteins that are extremely well conserved 
during evolution, from bacteria to humans (Ames and Lecar, FASEB J., 1992, 6, 2660- 
2666). The ABC proteins are involved in extra- and intracellular membrane transport of 

15 various substrates, for example ions, amino acids, peptides, sugars, vitamins or steroid 
hormones. Among the 40 characterized humans members, 11 members have been 
described as associated with human disease, such as inter alia ABCAl, ABCA4 
(ABCR) and ABCC7 (CFTR) which are thought to be involved in Tangier disease 
(Bodzioch M et al., Nat. Genet., 1999, 22(4) ; 347-351; Brooks-Wilson et al., Nat 

20 Genet,l999, 22(4), 336-345 ; Rust S et al., Nat Genet., 1999, 22, 352-355; Remaley A 
T et al,, ), the Stargardt disease (Lewis R A et al., Am. J. Hum. Genet, 1999, 64, 422- 
434), and tiie Cystic Fibrosis (Riordan JM et al., Science, 1989, 245, 1066-1073), 
respectively. These implications reveal the importance of the functional role of the ABC 
gene family and the discovery of new family gene members should provide new insights 

25 into the physiopathology of human diseases. 

The prototype ABC protein binds ATP and uses the energy from ATP hydrolysis 
to drive the transport of various molecules across cell membranes. The functional 
protein contains two ATP-binding domains (nucleotide binding fold, NBF) and two 
transmembrane (TM) domains. The genes are typically organized as ftill transporters 

30 containing two of each domain, or half transporters with only one of each domain. Most 
full transporters are arranged in a TM-NBF-TM-NBF fashion (Dean et al., Curr Opin 
Genet, 1995, 5, 79-785). 
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Analysis of amino adds sequence alignments of the ATP-binding domains has 
allowed the ABC genes to be separated into sub-families (Allikmets et d.. Hum Mol 
Genet, 1996, 5, 1649-1655). Currently, according to the recent HUGO classification, 
seven ABC gene sub-families named ABC (A to G) have been described in the human 

5 genome (ABCl, CFTR/MRP, MDR, ABC8, ALD, GCN20, OABP) with all except one 
(OABP) containing multiple members. For the most part these sub-families contain 
genes that also display considerable conservation in the transmembrane domain 
sequences and have similar gene organization. However, ABC proteins transport very 
various substrates, and some members of different sub-families have been shown to 

10 share more similarity in substrate recognition than do proteins within same sub-family. 
Five of the sub-families are also represented in the yeast genome, indicating that these 
groups have been and retained early in the evolution of eukaryotes (Decottignies et al., 
Nat Genet, 1997, 137-45; MichaeUs et al., 1995, Cold Spring Harbor Laboratory Press). 
Several ABC transport proteins that have been identified in humans are associated with 

15 various diseases. For example, cystic fibrosis is caused by mutations in the CFTR 
(cystic fibrosis transmembrane conductance regulator) gene (Riordan JM et al., Science^ 
1989, 245, 1066-1073), Moreover, some multiple drug resistance phenotypes in tumor 
cells have been associated with the gene encoding the MDR (multi-drug resistance) 
protein, which also has an ABC transporter structure (Anticancer Drug Des. 1999 

20 Apr;14(2):l 15-31. ). Other ABC transporters have been associated with neuronal and 
tumor conditions (US Patent No. 5,858,719) or potentially involved in diseases caused 
by impairment of the homeostasis of metals (Biochim Biophys Acta. 1999 Dec 
6;1461(2):18-404. ). Likewise, another transport ABC, designated PFIC2, appears to be 
involved in a progressive familial intrahepatic cholestasia form, this protein being 

25 potentially responsible, in humans, for the export of bile salts (Strautnieks S S et al, Nat 
Genet, 1998, 20, 233-238). 

Among the ABC sub-families, the ABCA gene subfamily is probably the most 
evolutionary complex. The ABCA genes and OABP represent the only two sub-families of 
ABC genes that do not have identifiable orthologs in the yeast genome (Decottignies and 

30 Goffeau, 1997; Michaelis and Berkower, 1995). There is, however, at least one uiBCA- 
related gene in C elegans (ced-7) and several in Drosophila, Thus the ABCA genes 
appear to have diverged after eukaryotes became multicellular and developed more 
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sophisticated transport requirements. To date eleven members of the human ABCA sub- 
family have been described, making it the largest such group. 

Full sequences of four genes of the ABCA sub-family have been described 
revealing a complex exon-intron structure. Best characterized ABCA genes are ABCA4, 

5 and ABCAl. In mammals the ABCAl gene is highly expressed in macrophages and 
monocytes and is associated with the engulfinent of apoptotic cells (Luciani et al. 
Genomics (1994) 21, 150-9; Moynault et al., Biochem Soc Trans (1998) 26, 629-35; Wu 
et al., Cell (1998) 93, 951-60). The ced-7 gene, ortholog of ABCAl in C. elegans, plays 
a role in the recognition and enguljSnent of apoptotic cells suggesting a conserved 

10 function. Recently ABCAl was demonstrated to be the gene responsible for Tangier 
disease, a disorder characterized by high levels of cholesterol in peripheral tissues, and a 
very low level of HDLs, and fanoilial hypoalphalipoproteinemia (FHD) (Bodzioch et al., 
Nat Genet (1999) 22, 347-51; Brooks-Wilson et al., Nat Genet (1999) 336-45; Rust et 
al., Nat Genet (1999) 22, 352-5; Marcil et al.. The Lancet (1999) 354, 1341-46). The 

15 ABCAl protein is proposed to function in the reverse transport of cholesterol from 
peripheral tissues via an interaction with the qpolipoprotein 1 (ApoA-1) of HDL tissues 
(Wang et al., JBC (2000). The ABCA2 gene is highly expressed in the brain and 
ABCA3 in the lung but no fimction has been ascribed to fliese lod. The ABCA4 gene is 
exclusively expressed in the rod photoreceptors of the retina and mutations thereof are 

20 responsible for several pathologies of human eyes, such as retinal degenerative disordors 
retmoids (AUikmets et al., Science (1997) 277, 1805-1807; Allikmets et al., Nat Genet 
(1997) 15, 236-246; Sun et al., J Biol Chem (1999) 8269-81; Weng et al.. Cell (1999) 
98, 13-23; Cremers et al.. Hum Mol Genet (1998) 7, 355-362; Martinez-Mir et al.. 
Genomics (1997) 40, 142-146). ABCA4 is believed to transport retinal and/or retinal- 

25 phospholipid complexes from the rod photoreceptor outer segment disks to the 
cytoplasm, facilitating phototransduction. 

Therefore characterization of new genes from the ABCA subfamily is likely to yield 
biologically important transporters that may have an translocase activity for membrane 
lipid transport and may play a major role in human pathologies. 

30 Lipids are water-insoluble organic biomolecules, which are essential components of 

diverse biological functions, including the storage, transport, and metabolism of energy, 
and membrane structure and fluidity. Lipids are derived from two sources in humans and 
other animals: some lipids are ingested as dietary fats and oils and other lipids are 
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biosynthesized by the human or animal. In mammals at least 10% of the body weight is 

lipid, the bulk of which is in the fonn of triacylglycerols. 

Triacylglycerols, also known as triglycerides and triacylglycerides, are made up of 

three fatty acids esterified to glycerol. Dietary triacylglycerols are stored in adipose tissues 
5 as a source of energy, or hydrolyzed in the digestive tract by triacylglycerol lipases, the 

most important of which is pancreatic lipase. Triacylglycerols are transported between 

tissues in the form of lipoproteins. 

Lipoproteins are micelle-like assemblies found in plasma and contain varying 

proportions of different types of lipids and proteins (called apoproteins). There are five 
10 main classes of plasma lipoproteins, the major function of which is lipid transport. These 

classes are, in order of increasing density, chylomicrons, very low density lipoproteins 

(VLDL), intermediate-density lipoproteins (IDL), low density lipoproteins (LDL), and high 

density lipoproteins (HDL). Although many types of lipids are found associated with each 

lipoprotein class, each class transports predominantly one type of lipid: triacylglycerols are 
15 transported in chylomicrons, VLDL, and IDL; while phospholipids and cholesterol esters 

are transported in HDL and LDL respectively. 

Phospholipids are di-fatty add esters of glycerol phosphate, also containing a polar 

groi^ coupled to the phosphate. Phospholipids are important structural components of 

cellular membranes. Phospholipids are hydrolyzed by enzymes called phospholipases. 
20 Phosphatidylcholine, an exemplary phospholipid, is a major component of most eukaryotic 

cell membranes. 

Cholest^ol is the metabolic precursor of steroid hormones and bile adds as well as 
an essential constituent of cell membranes. In humans and other animals, cholesterol is 
ingested in the diet and also synthesized by the liver and other tissues. Cholesterol is 
25 transported between tissues in the form of cholesteryl esters in LDLs and other 
lipoproteins. 

Membranes surround every living cell, and serve as a barrier between the 
intracellular and extracellular compartments. Membranes also enclose the eukaryotic 
nucleus, make up the endoplasmic reticulum, and serve specialized functions such as in the 
30 myelin sheath that surrounds axons. A typical membrane contains about 40% lipid and 
60% protein, but there is considerable variation. The major lipid components are 
phospholipids, spedfically phosphatidylcholine and phosphatidylethanolamine, and 
cholesterol. The physicochemical properties of membranes, such as fluidity, can be 
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changed by modification of either the fatty acid profiles of the phospholipids or the 
cholesterol content Modulating the composition and organization of membrane lipids also 
modulates membrane-dependent cellular functions, such as receptor activity, endocytosis, 
and cholesterol flux. 

5 High-density lipoproteins (HDL) are one of the five major classes of lipoproteins 

circulating in blood plasma. These lipoproteins are involved in various metabolic pathways 
such as lipid transport, the formation of bile acids, steroidogenesis, cell proliferation and, 
in addition, interfere witii the plasma proteinase systems. 

HDLs are perfect free cholesterol acceptors and, in combination with enzymatic 

10 activities such as that of the cholesterol ester transfer protein (CETP), the lipoprotein lipase 
(LPL), the hepatic lipase (HL) and the lecithinxholesterol acyltransferase (LCAT), play a 
major role in the reverse transport of cholesterol, that is to say the transport of excess 
cholesterol in the peripheral cells to the liver for its elimination from the body in the form 
of bile acid. It has been demonstrated that the HDLs play a central role in the transport of 

15 cholesterol from the peripheral tissues to the liver. 

Various diseases linked to an HDL deficiency have been desoibed, including 
Tangier, FHD disease, and LCAT deficiency. In addition, HDL-cholesterol deficiencies 
have been observed in patients suffering fix>m malaria and diabetes (Kittl et al., 1992. 
Wein Klin Wochenschr 104 :21-4; Nilsson et al., 1990, J. hitem. Med., 227:151-5; 

20 Djoumessi, 1989, Pathol Biol, 37:909-11; Mohanty et al., 1992. Ann Trop Med 
Parasitol., 86 :601-6; Maurois et al., 1985, Biochimie., 67 :227-39; Grellier et al., 1997. 
Vox Sang. 72 :211-20; Agbedana et al., 1990, Ann Trop Med Parasitol., 84 :529-30; 
Erel et al., 1998, Haraiatologia, Budap, 29 :207-12; Cuisinier et al., 1990, Med Trop, 
50 :91-5; Chander et al., 1998, hidian J EXP Biol., 36 :371-4; Efthimiou et al., 1992, 

25 Wein Klin Wochenschr., 104 :705-6; Baptista et al., 1996. Parasite, 3:335-40; Davis et 
al., 1993, J. Infect 26 :279-85; Davis et al., 1995, J. Infect. 31:181-8; Pirich et al., 1993, 
Semin Thromb Hemost., 19:138-43; Tomlinson and Raper, 1996, Nat. Biotechnol., 
14:717-21; Hager and Hajduk, 1997, Nature 385:823-6; Kwiterovich, 1995, Ann NY 
Acad Sci., 748 :313-30 ; Syvanne et al. 1995, Ckculation, 92:364-70; and Syvanne et 

30 al., 1995, J.LIPID Res., 36:573-82). The deficiency involved in Tangier and/or FHD 
disease is linked to a cellular defect in the translocation of cellular cholesterol which 
causes a degradation of the HDLs and leads to a disruption in the lipoprotein 
metabolism. 
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Atherosclerosis is deiSbaed in histological terms by deposits (lipid or fibrolipid 
plaques) of lipids and of other blood derivatives in blood vessel walls, especially the large 
arteries (aorta, coronary arteries, carotid). These plaques, which are more or less calcified 
according to the degree of progression of the atherosclerosis process, may be coupled with 
5 lesions and are associated with the accumulation in the vessels of fatty deposits consisting 
essentially of cholesterol esters. These plaques are accompanied by a thickening of the 
vessel wall, hypertrophy of the smooth muscle, appearance of foam cells (lipid-laden cells 
resulting from uncontrolled uptake of cholesterol by recruited macrophages) and 
accumulation of fibrous tissue. The atheromatous plaque protrudes markedly from the 

10 wall, endowing it with a stenosing character responsible for vascular occlusions by 
atheroma, thrombosis or embolism, which occur in those patients who are most affected. 
These lesions can lead to serious cardiovascular pathologies such as myocardial infarction, 
sudden death, cardiac insufficiency, and stroke. 

Mutations within genes that play a role in lipoprotein metaboUsm have been 

15 identified. Specifically, several mutations m the apolipoprotein apoA-I gene have been 
characterized. These mutations are rare and may lead to a lack of production of apoA-L 
Mutations in the genes encoding LPL or its activator apoC-U are associated with severe 
hypertriglyceridemias and substantially reduced HDL-C levels. Mutations in the gene 
encoding the enzyme LCAT are also associated with a severe HDL deficiency. 

20 hi addition, dysfunctions in the reverse transport of cholesterol may be induced by 

physiological deficiencies affecting one or more of the stq)s in the transport of stored 
cholesterol, fix)m the intracellular vesicles to the membrane surface wh^e it is accepted by 
theHDLs. 

Therefore, an increasing need exists in the state of the art to identify genes involved 
25 in any of the steps in the metabolism of cholesterol and/or lipoproteins, and in particular, 
genes associated with dysfunctions in the reverse transport of cholesterol from the 
peripheral cells to the liver. 

The applicant have discovered and characterized a gene cluster containing 4 new 
genes belonging to the ABCA protein sub-family, which have been designated ABCA5, 
30 ABCA6, ABCA9, and ABCAIO. These new genes appear to be closely related to other 
ABCA subfamily members such as ABCAl and ABCA8, as the particularly in the ATP- 
binding domain, and more particularly in the C-terminal ATP binding domains. The 
newly discovered genes also show considerable conservation of the amino acid 
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sequeaces, particularly within the transmembrane region (TM) and the ATP-binding 
regions (NBD), and have a similar gene organization. 

Surprisingly, the applicants have found these genes to be organized in a single 
large cluster on chromosome 17q24, in a head-to-tail fashion, with a similar intron/exon 
5 organization, suggesting that they have arisen from tandem diqplication and may form a 
distinct functional group with the ABCA subfamily, 

Furfliennore, each of fte discovered genes is transcribed with a tissue-specific 
distribution, but present an heterogenous pattern of expression, suggestiag a regional 
and probably functional specialization of the corresponding proteins. 

10 

SUMMARY OF THE INVENTION 

The present invention relates to nucleic acids corresponding to the various human 
ABCA5, ABCA6, ABCA9, and ABCAIO genes, which are likely to be involved in the 
reverse transport of cholesterol, as well as in the membrane transport of lipophilic 
15 molecules, in particular inflammation mediating substances such as prostaglandins and 
prostacyclins, or in any pathology whose candidate chromosomal region is situated on 
chromosome 17, more precisely on the 17q arm and still more precisely in the 17q24 locus. 

Thus, a first subject of the invention is a nucleic acid comprising a nucleotide 
sequence of any one of SEQIDNOs: 1-4, and 9-126, or a complementary nucleotide 
20 sequence thereof. 

The invention also relates to a nucleic acid comprising at least 8 consecutive 
nucleotides of a nucleotide sequence of a) any one of SEQ ID NOs: 1-4, and 9-126 or a 
complementary nucleotide sequence thereof 

The invention also relates to a nucleic acid havmg at least 80% nucleotide identity 
25 with a nucleic acid comprising a nucleotide sequence of any one of SEQ ID NOs: 1-4, and 
9-126, or a complementary nucleotide sequence thereof. 

The invention also relates to a nucleic acid having at least 85%, preferably 90%, 
more preferably 95% and still more preferably 98% nucleotide identity with a nucleic acid 
comprising a nucleotide sequence of any one of SEQ ID NOs: 1-4 and 9-126, or a 
30 complementary nucleotide sequence thereof. 

The invention also relates to a nucleic acid hybridizing, under high stringency 
conditions, with a nucleotide sequence of any one of SEQ ID NOs: 1-4 and 9-126, or a 
complementary nucleotide sequence thereof. 
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The invention also relates to nucleic acids, particularly cDNA molecules, which 
encode the full length human ABCA5, ABCA6, ABCA9, and ABCAIO proteins. Thus, the 
invention relates to a nucleic acid comprising a nucleotide sequence of any one of 
SEQ ID NO: 1-4, or of a complementary nucleotide sequence. 
5 The invention also relates to a nucleic add comprising a nucleotide sequence as 

depicted in SEQ ID NO: 1-4, or a complementary nucleotide sequence. 

According to the invention, a nucleic acid comprising a nucleotide sequence of 
SEQ ID NO: 1, which encodes a foil length ABCA5 polypeptide of 1642 amino adds 
comprising the amino acid sequence of SEQ ED NO: 5. 
10 According to the invention, a nucleic acid comprising a nucleotide sequence of 

SEQ ID NO: 2, which encodes a full length ABCA6 polypeptide of 1617 amino acids 
comprising the amino add sequence of SEQ ID NO: 6. 

According to the invention, a nucleic acid comprising a nucleotide sequence of 
SEQ ID NO: 3, which encodes a foil length ABCA9 polypeptide of 1624 amino adds 
15 comprising the amino add sequence ofSEQ ID NO: 7. 

According to the invention, a nucldc add comprising a nucleotide sequence of 
SEQ ID NO: 4, which encodes a full length ABCAIO polypeptide of 1543 amino acids 
comprising the amino add sequence of SEQ ID NO: 8. 

Thus, the invention also relates to a nucleic add encoding a polypeptide comprising 
20 an amino add sequence of any one of SEQ ID NO: 5-8. 

Thus, the invention also relates to a polypeptide comprising an amino add sequence 
of any one of SEQ ID NO: 5-8. 

The invention also relates to a polypeptide comprising an amino acid sequence as 
depicted in any one of SEQ ID NO: 5-8. 
25 The invention also relates to a means for detecting polymorphisms in general, and 

mutations in particular, in the ABCA5, ABCA6, ABCA9, and ABCAIO genes or in the 
corresponding proteins produced by the allelic form of these genes. 

According to another aspect, the invention also relates to the nucleotide sequences 
of ABCA5, ABCA6, ABCA9, and ABCAIO genes comprising at least one biallelic 
30 polymorphism such as for example a substitution, addition or deletion of one or more 
nucleotides. 

Nucleotide probes and primers hybridizing with a nucleic add sequence located in 
the region of any one of ABCA5, ABCA6, ABCA9, and ABCAIO nucldc add (genomic 
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DNA, messenger RNA, cDNA), in particular, a nucleic add sequence comprising any one 
of the mutations or polymorphisms. 

The nucleotide probes or primers according to tiie invention comprise at least 8 
consecutive nucleotides of a nucleic acid comprising any one of SEQ ID NOs: 1-4 and 9- 
5 126 or a complementary nucleotide sequence thereof. 

Preferably, nucleotide probes or primers according to the invention will have a 
length of 10, 12, 15, 18 or 20 to 25, 35, 40, 50, 70, 80, 100, 200, 500, 1000, 1500 
consecutive nucleotides of a nucleic acid according to the invention, in particular of a 
nucleic acid comprising any one of SEQ ID NOs: 1-4 and 9-126, or a complementary 
10 nucleotide sequence thereof. 

Alternatively, a nucleotide probe or primer according to the invention will consist 
of and/or comprise fragments having a length of 12, 15, 18, 20, 25, 35, 40, 50, 100, 200, 
500, 1000, 1500 consecutive nucleotides of a nucleic acid according to the invention, more 
particularly of a nucleic add comprising any one of SEQ ID NOs: 1-4 and 9-126, or a 
15 complementary nucleotide sequence thereof 

The definition of a nucleotide probe or primer according to the invention therefore 
covers oligonucleotides which hybridize, under the high stringency hybridization 
conditions defined above, with a nucldc add comprising any one of SEQ ID NOs: 1-4 and 
9-126, or a complementary nucleotide sequence thereof. 
20 The preferred probes and primers according to the invention comprise all or part of 

a nucleotide sequence comprising any one of SEQ ID NOs: 127-217, or a complementary 
nucleotide sequence thereof. 

The nucleotide primers according to die invention may be used to amplify any one 
of the nucleic acids according to the invention, and more particularly a nucleic acid 
25 comprising a nucleotide sequence of any one of SEQ ID NOs: 1-4 and 9-126, or a 
complementary nucleotide sequence thereof 

According to the invention, some nucleotide primers specific for an ABCA5 gene, 
may be used to amplify a nucleic acid comprising a SEQ ID NOs: 1, and comprise a 
nucleotide sequence of any one of SEQ ID NOs: 127-150, or a complementary nucleotide 
30 sequence thereof 

The invention also relates to nucleotide primers are specific for an ABCA6 gene, 
may be used to amplify a nucleic acid comprising any one of SEQ ID NOs: 2 and 9-47, and 
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comprise a nucleotide sequence of any one of SEQ ID NOs: 151-177, or a complementary 
nucleotide sequence. 

The invention is fbrfher directed to nucleotide primers specific for an ABCA9 gene, 
may be used to amplify a nucleic acid comprising any one of SEQ ID NOs: 3, and 48-86, 
5 and comprise a nucleotide sequence of any one of SEQ ID NOs: 178-209, or a 
complementary nucleotide sequence. 

The present invention is further directed to nucleotide primers specific for an 
ABCAIO gene, may be used to amplify a nucleic acid comprising any one of SEQ ID NOs: 
4, and 87-126, and comprise a nucleotide sequence of any one of SEQ ID NOs: 210-217, or 
10 a complementary nucleotide sequence. 

Another subject of the invention relates to a method of amplifying a nucleic acid 
according to the invention, and more particularly a nucleic acid comprising a) any one of 
SEQ ID NOs: 1-4 and 9-126, a complementary nucleotide sequence thereof, or b) as 
depicted in any one of SEQ ID NOs: 1-4 and 9-126, or a complementary nucleotide 
15 sequence thereof, contained in a sample, said method comprising the steps of: 

a) bringing the sample in which the presence of the target nucleic acid is suspected 
into contact with a pair of nucleotide primers whose hybridization position is located 
respectively on the 5' side and on the 3' side of the region of the target nucleic acid whose 
amplification is sought, in the presence of the reagents necessary for flie amplification 

20 reaction; and 

b) detecting the amplified nucleic adds. 

The presmt invention also relates to a method of detecting the presence of a nucleic 
acid comprising a nucleotide sequence of any one of SEQ ID NOs: 1-4 and 9-126, or a 
complementary nucleotide sequence, or a nucleic add firagment or variant of any one of 
25 SEQ ID NOs: 1-4 and 9-126 , or a complementary nucleotide sequence in a sample, said 
method comprising the steps of: 

1) bringing one or more nucleotide probes according to the invention into contact 
with the sample to be tested; 

2) detecting the complex which may have formed between the probe(s) and the 
30 nucleic acid present in the sample. 

According to a specific embodiment of the method of detection according to the 
invention, the oligonucleotide probes are immobilized on a support 
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According to anofher aspect, the oligomcleotide probes comprise a detectable 
marker. 

Another subject of the invention is a box or kit for amplifying all or part of a 
nucleic add comprising a) any one of SEQ ID NOs: 1-4 and 9-126, or a complementary 
5 nucleotide sequence thereof, or b) as depicted in any one of SEQ ID NOs; 1-4 and 9-126 or 
of a complementary nucleotide sequence thereof, said box or kit comprising: 

1) a pair of nucleotide primers in accordance with the invention, whose 
hybridization position is located respectively on the 5' side and 3' side of the target nucleic 
acid whose amplification is sought; and optionally, 
10 2) reagents necessary for an amplification reaction. 

Such an amplification box or kit will preferably comprise at least one pair of 
nucleotide primers as described above. 

The invention also relates to a box or kit for detecting the presence of a nucleic acid 
according to the invention in a sample, said box or kit comprising: 
15 a) one or more nucleotide probes according to the invention; 

b) appropriate reagents necessary for a hybridisation reaction. 
According to a first aspect, the detection box or kit is characterized in tiiat the 
nucleotide probe(s) and primer(s)are immobilized on a support. 

According to a second aspect, the detection box or kit is characterized in that the 
20 nucleotide probe(s) and primer(s) comprise a detectable marker. 

According to a specific embodiment of the detection kit described above, such a kit 
will comprise a plxjrality of oligonucleotide probes and/or primers in accordance with the 
invmtion which may be used to detect target nucleic acids of interest or alternatively to 
detect mutations in the coding regions or the non-coding regions of the nucleic acids 
25 according to the invention. According to preferred embodiment of the invention, the target 
nucleic acid comprises a nucleotide sequence of any one of SEQ ID NOs: 1-4 and 9-126, or 
of a complementary nucleic acid sequence. Alternatively, the target nucleic acid is a 
nucleic acid fragment or variant of a nucleic acid comprising any one of SEQ ID NOs: 1-4 
and 9-126, or of a complementary nucleotide sequence. 
30 According to another preferred embodiment, a primer according to the invention 

comprises, generally, all or part of any one of SEQ ID NOs: 1-4, and 9-217, or a 
complementary sequence. 
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The invention also relates to a recombinant vector comprising a nucleic acid 
according to the invention. Preferably, such a recombinant vector will comprise a nucleic 
acid selected from the group consisting of 

a) a nucleic acid comprising a nucleotide sequence of any one of SEQ ID NOs: 1-4 
5 and 9-126^ or a complementary nucleotide sequence thereof, 

b) a nucleic acid comprising a nucleotide sequence as depicted in any one of SEQ 
E> NOs: 1-4 and 9-126, or a complementary nucleotide sequence thereol^ 

c) a nucleic acid having at least eight consecutive nucleotides of a nucleic acid 
comprising a nucleotide sequence of any one of SEQ ID NOs: 1-4 and 9-126, or a 

10 complementary nucleotide sequence thereof; 

d) a nucleic acid having at least 80% nucleotide identity with a nucleic acid 
comprising a nucleotide sequmce of any one of SEQ ID NOs: M and 9-126, or a 
complementary nucleotide sequence thereof; 

e) a nucleic acid having 85%, 90%, 95%, or 98% nucleotide identity with a nucleic 
15 acid comprising a nucleotide sequmce of any one of SEQ ID NOs: 1-4 and 9-126, or a 

complementary nucleotide sequence thereof; 

f) a nucleic acid hybridizing, under high stringency hybridization conditions, with a 
nucleic acid comprising a nucleotide sequence of any one of SEQ ID NOs: 1-4 and 9-126, 
or a complementary nucleotide sequence; and 

20 g) a nucleic add encoding a polypeptide comprising an amino acid sequence of 

SEQ ID NO: 5-8. 

Accoiding to a first embodiment, a recombinant vector according to the invention is 
used to amplify a nucleic acid inserted therein, following transformation or transfection of 
a desired cellular host. 

25 According to a second embodiment, a recombinant vector according to the 

invention corresponds to an expression vector comprising, in addition to a nucleic acid in 
accordance with the invention, a regulatory signal or nucleotide sequence that directs or 
controls transcription and/or translation of the nucleic acid and its encoded mRNA. 

According to a preferred embodiment, a recombinant vector according to the 
30 invention will comprise in particular the following components: 

(1) an element or signal for regulating the expression of the nucleic acid to be 
inserted, such as a promoter and/or enhancer sequence; 
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(2) a nucleotide coding region comprised within the nucleic acid in accordance with 
the invention to be inserted into such a vector, said coding region being placed in phase 
with the regulatory element or signal described in (1); and 

(3) an appropriate nucleic acid for initiation and termination of transcription of the 
5 nucleotide coding region of the nucleic acid described in (2). 

The present invention also relates to a defective recombinant virus 
comprising a cDNA nucleic add encoding any one of ABCA5, ABCA6, ABCA9, and 
ABCAIO polypeptides involved in the transport of lipophilic substances, in particular 
mediators of inflammation , or in any pathology whose candidate chromosomal region is 
10 situated on chromosome 17, more precisely on the 17q arm and still more precisely in the 
17q24 locus. 

In anotiier preferred embodiment of the invention, the defective recombinant virus 
comprises a gDNA nucleic acid encoding any one of ABCA5, ABCA6, ABCA9, ABCAIO 
polypeptides involved in the transport of lipophiHc substances, inflammatory lipophilic 

15 substances. Preferably, the ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides 
comprise amino add sequences selected from SEQ JD NO: 5-8, respectively. 

Tn another preferred embodiment, the invention relates to a defective recombinant 
virus comprising a nucldc add encoding any one of ABCA5, ABCA6, ABCA9, ABCAIO 
polypeptides involved in the transport of inflammatory lipophilic substances, under the 

20 control of a promoter chosen from RSV-LTR or the CMV early promoter. 

According to a specific embodiment, a method of introducing a nucleic acid 
according to the invention into a host cell, in particular a host cell obtained from a 
mammal, in vzvo, comprises a step during which a preparation comprising a 
pharmaceutically compatible vector and a '*naked" nucleic acid according to the invention, 

25 placed under the control of appropriate regulatory sequences, is introduced by local 
injection at the level of the chosen tissue, for example a smooth muscle tissue, the "naked" 
nucleic acid being absorbed by the cells of this tissue. 

According to a specific embodiment of the invention, a composition is provided for 
the in vivo production of any one of the ABCA5, ABCA6, ABCA9, and ABCAIO proteins. 

30 This composition comprises a nucleic acid encoding the ABCA5, ABCA6, ABCA9, and 
ABCAIO polypeptides placed under the control of appropriate regulatory sequences, in 
solution in a physiologically acceptable vehicle and/or excipient 
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Therefore, the present invention also relates to a composition comprising a nucleic 
acid encoding any one of ABCA5, ABCA6, ABCA9, ABCAIO polypeptides comprising 
an amino acid sequence selected fiom SEQ ID NO: S-8, wherein the nucleic add is placed 
under the control of appropriate regulatory elements. 
5 Consequently, the invention also relates to a pharmaceutical composition intended 

for the prevention of or treatment of a patient or subject affected by a dysfunction in the 
reverse transport of cholesterol or in the transport of inflammatory lipophilic substances 
comprising a nucleic acid encoding any one of ABCA5, ABCA6, ABCA9, and ABCAIO 
proteins, in combination with one or more physiologically compatible excipients. 

10 Preferably, such a composition will comprise a nucleic acid comprising a nucleotide 

sequence of any one of SEQ ID NO: 1-4 and 9-126, wherein the nucleic acid is placed 
under the control of an appropriate regulatory element or signal. 

In addition, the present invention is directed to a pharmaceutical composition 
intended for the prevention of or treatment of a patient or a subject affected by a 

15 dysfunction in the reverse transport of cholesterol or in the transport of liphophilic 
substances mediating inflammation, comprising a recombinant vector according to the 
invention, in combination with one or more physiologically compatible excipients. 

The invention also relates to the use of a nucleic acid according to the invention 
encoding any one of ABCA5, ABCA6, ABCA9, and ABCAIO proteins for the 

20 manufacture of a medicament intended for the prevention of arteriosclerosis in various 
forms or more particularly for the treatment of subjects affected by a dysfunction of 
cholesterol reverse transport or transport of liphophilic substances mediating inflanmiation. 

The invention also relates to the use of a recombinant vector according to the 
invention comprising a nucleic acid encoding any one of ABCA5, ABCA6, ABCA9, and 

25 ABCAIO proteins for the manufacture of a medicament intended for the prevention of 
arteriosclerosis in varioxis forms or more particularly for the treatment of subjects affected 
by a dysfunction of cholesterol reverse transport or transport of liphophilic substances 
mediating inflammation. 

The subject of the invention is therefore also a recombinant vector 

30 comprising a nucleic acid according to the invention that encodes any one of ABCA5, 
ABCA6, ABCA9 and ABCAIO proteins or polypeptides involved in the metabolism of 
cholesterol or transport of liphophiUc substances mediating inflammation. 
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The invention also relates to the use of such a recombinant vector for the 
preparation of a pharmaceutical composition intended for the treatment and/or for the 
prevention of diseases or conditions assodated with defidmq/ of lipophilic substances 
signaling inflammation, or deficiency of in the cholesterol reverse transport, or transport of 
5 inflammatory lipophilic substances. 

The present invention also relates to the use of cells genetically modified ex viva 
with such a recombinant vector according to flie invention, or cells producing a 
recombinant vector, wherein flie cells are implanted in the body, to allow a prolonged and 
effective expression in vivo of any one biologically active ABCA5, ABCA6, ABCA9, and 
10 ABCAl 0 polypeptides. 

The invention also relates to the use of a nucleic acid according to the invention 
encoding any one of ABCA5, ABCA6, ABCA9 and ABCAl 0 proteins for the manufacture 
of a medicament intended for the prevention and/or the treatment of subjects affected by a 
dysfunction of cholesterol reverse transport or inflammatory lipophilic substances 
15 transport. 

The invention also relates to the use of a recombinant vector according to the 
invention comprising a nucleic add encoding any one of ABCA5, ABCA6, ABCA9 and 
ABCAl 0 polypeptides according to the invention for the manufecture of a medicament 
intended for the prevention and/or the treatment of subjects affected by a dysfunction of the 

20 reverse transport of cholesterol or inflammatory lipophilic substances transport. 

The invention also relates to the use of a recombinant host cell according to the 
invention, comprising a nucldc add encoding any one of ABCA5, ABCA6, ABCA9 and 
ABCAl 0 polypeptides according to the invention for the manufecture of a medicament 
intended for the prevention and/or the treatment of subjects affected by a dysfunction of 

25 cholesterol reverse transport. 

The present invention also relates to the use of a recombinant vector according to 
the invention, preferably a defective recombinant virus, for the preparation of a 
pharmaceutical composition for the treatment and/or prevention of pathologies linked to 
the dysfunction of cholesterol reverse transport or inflanmiatory lipophilic substances 

30 transport. 

The invention relates to the use of such a recombinant vector or defective 
recombinant virus for the preparation of a pharmaceutical composition intended for the 
treatment and/or for the prevention of cardiovascular disease linked to a deficiency in the 
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reverse transport of cholesterol. Thus, the present invention also relates to a pharmaceutical 
composition comprising one or more recombinant vectors or defective recombinant viruses 
according to the invention. 

The present invention also relates to the use of cells genetically modified ex vivo 
5 with a virus according to the invention, or of cells producing such viruses, implanted in the 
body, allowing a prolonged and effective expression in vivo of any one biologically active 
of ABCA5, ABCA6, ABCA9 and ABCAIO proteins. 

The present invention shows that it is possible to incorporate a nucldc acid 
encoding any one of ABCA5, ABCA6, ABCA9 and ABCAIO polypeptides according to 

10 the invention into a viral vector, and that these vectors make it possible to effectively 
express a biologically active, mature polypeptide. More particularly, the invention shows 
that the in vivo expression of any one of ABCA5, ABCA6, ABCA9 and ABCAIO proteins 
may be obtained by direct administration of an adenovirus or by implantation of a 
producing cell or of a cell genetically modified by an adenovirus or by a retrovirus 

15 incorporating such a nucleic acid. 

In this regard, another subject of the invention relates to any mammalian cell 
infected with one or more defective recombinant viruses according to the invention. More 
particularly, the invention relates to any population of human cells infected with these 
viruses. These may be in particular cells of blood origin (totipot^t stem cells or 

20 precursors), fibroblasts, myoblasts, hepatocytes, keratinocytes, smooth muscle and 
endo&elial cells, glial cells and the like. 

Another subject of the invention relates to an implant comprising mammalian 
cells infected with one or more defective recombinant viruses according to the invention or 
cells producing recombinant viruses, and an extracellular matrix. Preferably, the implants 

25 according to the invention comprise 10^ to 10^0 cells. More preferably, they comprise 10^ 
to IQS cells. 

More particularly, in the implants of the invention, the extracellular matrix 
comprises a gelling compound and optionally, a support allowing the anchorage of the 
cells. 

30 The invention also relates to a recombinant host cell comprising a nucleic acid of 

the invention, and more particularly, a nucleic acid comprising any one of SEQ ID NO: 1-4 
and 9-126, or of a complementary nucleotide sequence. 
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The inveaition also relates to a recombinant host cell comprising a nucleic acid of 
the invention, and more particularly a nucleic acid comprising a nucleotide sequence as 
depicted in any one SEQ ID NO: 1-4 and 9-126, or of a complementaiy nucleotide 
sequence. 

5 According to another aspect, the invention also relates to a recombinant host cell 

comprising a recombinant vector according to the invention. Therefore, the invention also 
relates to a recombinant host cell comprising a recombinant vector comprising any of the 
nucleic acids of the invention, and more particularly a nucleic acid comprising any one 
nucleotide sequence of SEQ ID NO: 1-4 and 9-126, or a complementaiy nucleotide 
10 sequence thereof. 

Specifically, the invention relates to a recombinant host cell comprising a 
recombinant vector comprising a nucleic acid comprising any one of SEQ ID NOs: 1-4 and 
9-126, or a complementary nucleotide sequence thereof 

The invention also relates to a recombinant host cell comprising a recombinant 
15 vector comprising a nucleic add comprising a nucleotide sequence as depicted in any one 
of SEQ ID NOs: 1-4 and 9-126, or of a complementary nucleotide sequence thereof 

The invention also relates to a recombinant host cell comprising a recombinant 
vector comprising a nucleic add encoding a polypeptide comprising any one amino add 
sequence of SEQ ID NO:5-8. 
20 The invention also relates to a method for the production of a polypeptide 

comprising an amino add sequence of any one of SEQ ID NOs: 1-4 and 9-126, or of a 
peptide fragment or a variant thereof, said method comprising the steps of: 

a) inserting a nucleic acid encoding said polypeptide into an appropriate vector, 

b) culturing, in an appropriate cxilture medium, a previously transformed host cell or 
25 transfecting a host cell with the recombinant vector of step a); 

c) recovering the conditioned culture medium or lysing the host cell, for example by 
sonication or by osmotic shock; 

d) separating and purifying said polypeptide from said culture medium or 
altematively from the cell lysates obtained in step c); and 

30 e) where appropriate, characterizing the recombinant polypeptide produced. 

A polypeptide termed 'liomologous" to a polypeptide having an amino acid 
sequence selected from SEQ ID NO: 5-8 also forms part of the invention. Such a 
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homologous polypeptide comprises an amino add sequence possessing one or more 

substitutions of an amino acid by an equivalent amino acid. 

The ABCA5, ABCA6, ABCA9, ABCAIO polypeptides according to the invention, 

in particular 1) a polypeptide comprising an amino acid sequ^ice of any one of SEQ ID 
5 NOs: 5-8, 2) a polypeptide fragment or variant of a polypeptide comprising an amino acid 

sequence of any one of SEQ ID NOs: 5-8, or 3) a polypeptide termed **homologous" to a 

polypeptide comprising amino acid sequence selected from SEQ ID NO:5-8. 

In a specific embodiment, an antibody according to the invention is directed against 

1) a polypeptide comprising an amino acid sequence of any one of SEQ ID NOs; 5-8, 2) a 
10 polypeptide fragment or variant of a polypeptide comprising an amino acid sequence 

selected from SEQ ID NOs: 5-8, or 3) a polypeptide termed 'liomologous'* to a polypeptide 

comprising amino acid sequence selected from SEQ ID NO: 5-8. Such antibody is 

produced by using the trioma technique or the hybridoma technique described by 

Kozbor et al. (Immunology Today, (1 983) 4:72). 
15 Thus, the subject of the mvention is, in addition, a method of detecting the presence 

of any one of the polypeptides according to the invention in a sample, said method 

comprising the steps of: 

a) bringing the sample to be tested into contact with an antibody directed against 1) 
a polypeptide comprising an amino acid sequence of any one of SEQ ID NOs: 5-8, 2) a 

20 polypeptide fragment or variant of a polypeptide comprising an amino add sequence 
selected from SEQ ID NOs: 5-8, 3) a polypeptide termed '^homologous" to a polypeptide 
comprising amino add sequence of any one of SEQ ID NO: 5-8, and 

b) detecting the antigen/antibody complex formed. 

The invention also relates to a box or kit for diagnosis or for detecting the presence 
25 of any one of polypeptide in accordance with the invention in a sample, said box 
comprising: 

a) an antibody directed against 1) a 5-8, 2) a polypeptide fragment or variant of a 
polypeptide comprising an amino acid sequence of any one of SEQ ID NOs: 5-8, or 3) a 
polypeptide *1iomologous" to a polypeptide comprising amino acid sequence of SEQ ID 
30 NO: 5-8, and 

b) a reagent allowing the detection of the antigen/antibody complexes formed. 
The invention also relates to a phannaceutical composition comprising a nucleic 
acid according to the inventioiL 
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The invention also provides pharmaceutical compositions comprising a nucleic acid 
encoding any one of ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides according to 
the invention and pharmaceutical compositions comprising any one of ABCA5, ABCA6, 
ABCA9, ABCAIO polypeptides according to the invention intended for the treatment of 
5 diseases linked to a deficiency of cholesterol reverse transport or inflammatory lipophilic 
substances transport. 

The present invention also relates to a new therapeutic approach for the treatment of 
pathologies linked to deficiency of cholesterol reverse transport or inflammatory lipophilic 
substances transport, comprising transferring and expressing in vivo nucleic acids encoding 
10 any one of ABCA5, ABCA6, ABCA9, and ABCAl 0 proteins according to the invention. 

Thus, the present invention offers a new approach for the treatment and/or 
prevention of pathologies linked to the abnormalities of cholesterol reverse transport or 
inflammatory lipophilic substances. Specifically, the present invention provides methods 
to restore or promote improved cholesterol reverse transport or improved inflammatory 
15 lipophilic substances transport in a patimt or subject. 

Consequendy, the invention also relates to a pharmaceutical composition intended 
for the prevention and/or treatment of subjects affected by, a dysfimction of cholesterol 
reverse transport or inflanomatory lipophilic substances transport, comprising a nucleic acid 
encoding any one of the ABCA5, ABCA6, ABCA9, and ABCAl 0 proteins, in combination 
20 with one or more physiologically compatible vehicle and/or excipient. 

According to a specific embodiment of the invention, a composition is provided for 
the in vivo production of any one of the ABCA5, ABCA6, ABCA9, and ABCAIO protems. 
This composition comprises a nucleic add encoding any one of the ABCA5, ABCA6, 
ABCA9, and ABCAIO polypeptides placed under the control of appropriate regulatory 
25 sequences, in solution in a physiologically compatible vehicle and/or excipient. 

Therefore, the present invention also relates to a composition comprising a nucleic 
acid encoding a polypeptide comprising an amino acid sequence of any one of ID NO: 5-8, 
wherem the nucleic acid is placed under the control of appropriate regulatory elements. 

Preferably, such a composition will comprise a nucleic acid comprising a nucleotide 
30 sequence of any one of SEQ ID NO: 1-4 and 9-126, placed under the control of appropriate 
regulatory elemaits. 

The invention also relates to a pharmaceutical composition intended for the 
prevention of or treatment of subjects affected by, a dysfimction of cholesterol reverse 
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transport or inflammatoiy lipophilic substances transport, comprising a recombinant vector 
according to the invention, in combination with one or more physiologically compatible 
vehicle and/or exdpient 

According to another aspect, the subject of the invention is also a preventive or 

5 curative therapeutic method of treating diseases caused by a deficiency of cholesterol 
reverse transport or inflammatory lipophilic substances transport, such a method 
comprising administering to a patient a nucleic acid encoding any one of ABCA5, ABCA6, 
ABCA9, and ABCAIO polypeptides according to the invention, said nucleic acid being 
combined with one or more physiologically appropriate vehicles and/or excipients. 

10 The invention relates to a pharmaceutical composition for the prevention and/or 

treatment of a patient or subject affected by a dysftinction of cholesterol reverse transport 
or inflammatory lipophilic substances transport, comprising a therapeutically effective 
quantity of a polypeptide having an amino acid sequence selected from SEQ ID NO: 5-8, 
combined with one or more physiologically appropriate vehicles and/or excipients. 

15 According to a specific embodiment, a method of introducing at least a nucleic acid 

according to the invention into a host cell, in particular a host cell obtained from a 
mammal, in vivo, comprises a step during which a preparation comprising a 
pharmaceutically compatible vector and a "naked" nucleic acid according to the invention, 
placed under the control of appropriate regulatory sequences, is introduced by local 

20 injection at the level of the chosen tissue, for example a smooth muscle tissue, the "naked" 
nucleic add being absorbed by the cells of this tissue. 

According to yet another aspect, the subject of the invention is also a preventive or 
curative therapeutic method of treating diseases caused by a deficiency of cholesterol 
reverse transport or inflammatory lipophilic substances transport, such a method 

25 comprising administering to a patient a therapeutically effective quantity of at least one 
ABCA5, ABCA6, ABCA9, or ABCAIO polypeptide according to the invention, said 
polypeptide being combined with one or more physiologically appropriate vehicles and/or 
excipients. 

The invention also provides methods for screening small molecules and 
30 compounds that act on any one of ABCA5, ABCA6, ABCA9, and ABCAIO proteins to 
identify agonists and antagonists of such polypeptides that can restore or promote improved 
cholesterol reverse transport or inflammatory lipophilic substances transport to effectively 
cure and or prevent dysftinctions thereof These methods are useful to identify small 
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molecules and compounds for therapeutic use in the treatment of diseases due to a 
deficiency of cholesterol reverse transport or inflammatory lipophilic substances transport. 

Th^efore, the invaition also relates to the use of any one of ABCA5, ABCA6, 
ABCA9, and ABCAIO polypeptides or a cell expressing any one of ABCA5, ABCA6, 
5 ABCA9, and ABCAIO polypeptides according to flie invention, for screening active 
ingredients for the prevention and/or treatment of diseases resulting from a dysfunction 
cholesterol reverse transport or inflammatory lipophilic substances transport. 

The invention also relates to a method of screening a compound or small molecule, 
an agonist or antagonist of any one of ABCA5, ABCA6, ABCA9, and ABCAIO 
10 polypeptides, said method comprising the following steps: 

a) preparing a membrane vesicle comprising any one of ABCA5, ABCA6, ABCA9, 
and ABCAIO polypeptides and a lipid substrate comprising a detectable marker; 

b) incubating the vesicle obtained in step a) with an agonist or antagonist candidate 
compound; 

15 c) qualitatively and/or quantitatively measuring release of the lipid substrate 

comprising a detectable marker; and 

d) comparing the release measurement obtained in step b) with a measurement of 

release of a labelled lipid substrate by a vesicle that has not been previously incubated with 

the agonist or antagonist candidate compound. 
20 In a first specific embodiment, the ABCA5, ABCA6, ABCA9, and ABCAIO 

polypeptides comprise SEQ ID NO: 5-8, respectively. 

The invention also relates to a method of screening a compound or small molecule, 

an agonist or antagonist of any one of ABCA5, ABCA6, ABCA9, and ABCAIO 

polypeptides, said method comprising the following steps: 
25 a) obtaining a cell, for example a cell line, that, either naturally or after transfecting 

the cell with any one of ABCA5, ABCA6, ABCA9, and ABCAIO ABCl encoding nucleic 

acids, expressing the corresponding ABCA5, ABCA6, ABCA9, and ABCAIO 

polypeptides; 

b) incubating the cell of stqpa) in the presence of an anion labelled with a 
30 detectable marker; 

c) washing the cell of step b) in order to rraiove the excess of the labelled anion 
which has not penetrated into these cells; 
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d) incubating the cell obtained in step c) with an agonist or antagonist candidate 
compound for the any one of ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides; 

e) measuring efflux of the labelled anion; and 

f) comparing the value of efflux of the labelled anion determined in step e) with a 
5 value of efiQux of a labelled anion measured with cell which has not been previously 

incubated in the presence of tiie agonist or antagonist candidate compound for any one of 

the ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides. 

In a first specific embodiment, the ABCA5, ABCA6, ABCA9, and ABCAIO 

polypeptides comprise SEQ ID NO: 5-8, respectively. 
10 The invention also relates to a method of screening a compound or small molecule 

active, an agonist or antagonist of any one of ABCA5, ABCA6, ABCA9, and ABCAIO 

polypeptides, said method comprising the following steps: 

a) culturing cells of a human monocytic line in an appropriate culture medium, in 

tiie presence of purified human albumin; 
15 b) incubating the cells of step a) simultaneously in the presence of a compound 

stimulating the production of IL-1 beta and of the agonist or antagonist candidate 

compound; 

c) incubating the cells obtained in stepb) in the presence of an appropriate 
concentration of ATP; 
20 d) measuring IL-1 beta released into the cell culture supernatant; and 

e) comparing the value of the release of the IL-1 beta obtained in step d) witti the 
value of the IL-1 beta released into the culture supernatant of cells which have not been 
previously incubated in the presence of the agonist or antagonist candidate compound. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1: represents the Map of the 17q24 region containing the ABCA5,6,8-10 
genes. A physical map of the portion of chromosome 17q24 is shown 
containing the 5 ABCA genes. Location of the microsatellite marker 
30 D17S940 locus is indicated, along with the boundaries of BAG clones 

hRPK.235_U0 and hRPK.293jeL20 (GenBank accession #s AC005495, 
AC005922). Genes orientation is indicated by the arrows, and the size and 
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location of the corresponding transcripts is shown below the map. • 
indicates the initiation codon; | represents the stop codon; — symbolizes flie 
working draft sequences. 

5 Figure 2: represents the alignment of ABCl -like genes. An alignment of the amino 
acid sequence of the fulHength ABCA6, 8, 9, and 10 open reading 
frames, and the partial sequence of ABCA5 and ABCAIO is shown as 
aligned to ABCAl. 

10 Figure 3: Maximum parsimony tree of ABCl-like genes. Phylogenetic trees were 
constructed with the alignments of the N- and C- terminal ATP-binding 
domains* sequence by both neighbor joining and maximum parsimony 
methods. 

15 Figure 4: Northem blot analysis of poly(A)+ RNA from 20 human tissues: pancreas 
(lane 1), kidney (2), skeletal muscle (3), liver (4), lung (5), placenta (6), 
brain (7), heart (8), leukocyte (9), colon (10), small intestine (11), ovary 
(12), testis (13), prostate (14), thymus (15), spleen (16), fetal kidney (17), 
fetal liver (18), fetal lung (19) and fetal brain (20). Hybridization with a 

20 probe specific of ABCA5 (A), ABCA6 (B), ABCA9 (C), and ABCAIO (D). 

Figure 5: displays the pattern of expression of the gene encoding the ABCA9 protein 
by in situ hybridization using an antisense ABCA9 RNA probe on a normal 
renal artery section showing medial smooth muscle (60X). 

25 

Figure 6: displays the pattern of expression of the gene encoding the ABCA9 protein 
by in situ hybridization using an antisense ABCA9 RNA probe on a normal 
renal artery section showing adventitial nerve and Schwann cells (60X). 



30 Figure 7: 



displays the pattern of expression of the gene encoding the ABCA9 protein 
by in situ hybridization using an antisense ABCA9 RNA probe on a normal 
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renal arteiy section showing adjacent ganglions and Schwann cells (60X). 

Figure 8: displays the pattern of expression of the geae encoding the ABCA9 protein 
by in situ hybridization using an antisense ABCA9 RNA probe on an 
5 adjacent kidney section showing a collecting duct epithelium (60X). 

Figure 9: displays the pattern of expression of the gene encoding the ABCA9 protein 
by in situ hybridization using an antisense ABCA9 RNA probe on an 
adjacent kidney section showing a renal tubular epithelial (60X). 

10 

Figure 10: displays the pattern of expression of the gene encoding the ABCA9 protein 
by in situ hybridization using an antisense ABCA9 RNA probe on a section 
of normal heart showing cardiac myocytes (60X). 

15 Figure 1 1 : displays the pattern of expression of the gene encoding the ABCA9 protein 
by in situ hybridization using an antisense ABCA9 RNA probe on a section 
of nonnal heart showing interstitial vascular endothelial cells and fibroblasts 
(60X). 

20 Figure 1 2: displays the pattern of expression of the gene encoding the ABCAl 0 protein 
by in situ hybridization using an antisense ABCAl 0 RNA probe on a section 
of arterial tissues showing an adjacent lymph node, lymphocytes and 
macrophages (60X). 

25 Figure 1 3 : displays the pattern of expression of the gene encoding the ABCAl 0 protein 
by in situ hybridization using an antisense ABCAl 0 RNA probe on a section 
of arterial tissues showing Schwann cells in a nerve (60X). 



Figure 14: 

30 



displays the pattern of expression of the gene encoding the ABCAl 0 protein 
by in situ hybridization using an antisense ABCAIO RNA probe on a 
myocardial tissue section showing myointimal cells in an atheroma (60X). 
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displajra the pattern of expression of the gene encoding the ABCAIO protein 
by in situ hybridization using an aatisense ABCAIO RNA probe on an tissue 
section adjacent to a myocardial tissue, showing a ganglion and Schwann 
cells (60X). 

displays the pattern of expression of the gene aicoding the ABCAIO protein 
by in situ hybridization using an antisense ABCAIO RNA probe on a 
skeletal tissue section showing macrophages (60X). 

displays the pattern of expression of the gene encoding the ABCAIO protein 
by in situ hybridization using an antisense ABCAIO RNA probe on a 
skeletal tissue section showing Schwann cells in a nerve(60X). 

DETAILED DESCRIPTION OF THE INVENTION 

15 

GENERAL DEFINITIONS 

The present invention contemplates isolation of human genes encoding ABCA5, 
ABCA6, ABCA9, and ABCAIO polypeptides of the invention, including full lengths, or 
20 naturally occurring forms of ABCA5, ABCA6, ABCA9, and ABCAIO and any antigenic 
fragments thereof from any animal, particularly mammalian or avian, and more particularly 
human source. 

In accordance with the present invention, conventional molecular biology, 
microbiology, and recombinant DNA techniques within flie skill of the art are used. Such 

25 techniques are fully explained in the literature (Sambrook et al., 1989. Molecular cloning a 
laboratory manual. 2ed. Cold Spring Harbor Laboratory, Cold spring Harbor, New York; 
Glover, 1985, DNA Cloning: A pratical approach, volumes I and 11 oligonucleotide 
synthesis, MRL Press, LTD.., Oxford, U.K.; Hames and Higgins, 1985, Transcription and 
translation; Hames and Higgins, 1984, Animal Cell Culture; Freshney, 1986, Immobilized 

30 Cells And Enzymes, IRL Press; and Perbal, 1984, A practical guide to molecular cloning). 

As used herein, the term "gene" refers to an assembly of nucleotides that encode a 
polypeptide, and includes cDNA and genomic DNA nucleic adds. 
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The term "isolated" for the purposes of the present invention designates a biological 
material (nucleic add or protein) which has been removed from its original environment 
(the environment in which it is naturally present). 

For example, a polynucleotide present in the natural state in a plant or an animal is 
5 not isolated. The same nucleotide separated from the adjacent nucleic acids in which it is 
naturally inserted in the genome of the plant or animal is considered as being "isolated". 

Such a polynucleotide may be included in a vector and/or such a polynucleotide 
may be included in a composition and remains nevertheless in flie isolated state because of 
the fact that the vector or the composition does not constitute its natural environment. 
10 The term "purified" does not require the material to be present in a form exhibiting 

absolute purity, exclusive of the presence of other compounds. It is rather a relative 
definition. 

A polynucleotide is in the "purified" state after purification of the starting material 
or of the natural material by at least one order of magnitude, preferably 2 or 3 and 

15 preferably 4 or 5 orders of magnitude. 

For the purposes of the present description, the expression "nucleotide sequence" 
may be used to designate either a polynucleotide or a nucleic acid. The expression 
"nucleotide sequence" covers the genetic material itself and is therefore not restricted to the 
information relating to its sequence. 

20 The terms **nucleic acid", "polynucleotide", "oligonucleotide" or '^nucleotide 

sequence" cover RNA, DNA, or cDNA sequences or alternatively RNA/DNA hybrid 
sequences of more than one nucleotide, either m the single-stranded form or in the duplex, 
double-stranded form. 

A "nucleic acid" is a polymeric compound comprised of covalently linked subunits 

25 called nucleotides. Nucleic acid includes polyribonucleic acid (RNA) and 
polydeoxyribonucleic acid (DNA), both of which may be single-stranded or double- 
stranded. DNA includes cDNA, genomic DNA, synthetic DNA, and semi-synthetic DNA. 
The sequence of nucleotides that encodes a protein is called the sense sequence or coding 
sequence. 

30 The term "nucleotide" designates both the natural nucleotides (A, T, G, C) as well 

as the modified nucleotides that comprise at least one modification such as (1) an analog of 
a purine, (2) an analog of a pyrimidine, or (3) an analogous sugar, examples of such 
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modified nucleotides being desaibed, for example, in the PCX application 
No. WO 95/04 064. 

For the purposes of the present invention, a first polynucleotide is considered as 
being "complementaiy" to a second polynucleotide when each base of the first nucleotide 
5 is paired with the complementary base of the second polynucleotide whose orientation is 
reversed. The complementary bases are A and T (or A and U), or C and G. 

"Heterologous" DNA refers to DNA not naturally located in the cell, or in a 
chromosomal site of the cell. Preferably, the heterologous DNA includes a gene foreign to 
the cell. 

10 As used herein, the term "homologous" in all its grammatical forms and spelling 

variations ref^ to the relationship between proteins that possess a "common evolutionary 
origin," including proteins firom superfamilies the immxmoglobulin superfamily) and 
homologous protems fi-om different species (e.g., myosin light chain, etc.) (Reeck et al., 
1987, Cell 50 :667)). Such proteins (and their encoding genes) have sequence homology, 

15 as reflected by their high degree of sequence similarity. 

Accordingly, the term "sequence similarity" in all its grammatical forms refers to 
the degree of identity or correspondence between nucleic acid or amino acid sequences of 
proteins that may or may not share a common evolutionary origin (see Reedc et al., supra). 
However, in common usage and in the instant application, the term "homologous," when 

20 modified with an adverb such as 'Tiighly," may refer to sequence similarity and not a 
common evolutionary origin. 

In a specific embodiment, two DNA sequences are "substantially homologous" or 
"substantially similar" when at least about 50% ^preferably at least about 75%, and more 
preferably at least about 90 or 95%) of the nucleotides match over the defined length of the 

25 DNA sequences. Sequences that are substantially homologous can be identified by 
comparing the sequences using standard software available in sequence data banks, or in a 
Southern hybridization experiment under, for example, stringent conditions as defined for 
that particular system. Defining appropriate hybridization conditions is within the skill of 
the art. See, e.g., Maniatis et al., supra; Glover et al. (1985. DNA Cloning: A practical 

30 approach, volumes I and n oligonucleatide synthesis, MRL Press, Ltd, Oxford, U.K.); 
Hames and Hig^ns (1985. Transcription and Translation). 

Similarly, in a particular embodiment, two amino acid sequences are "substantially 
homologous" or "substantially similar" when greater than 30% of the amino adds are 
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identical, or greater than about 60% are similar (functionally identical). Preferably, the 

similar or homologous sequences are identified by alignment using, for example, the GCG 

(Genetics Computer Groiq>, Program Manual for flie GCG Package, Version 7, Madison, 

Wisconsin) pileup program. 
5 The 'percentage identity** between two nucleotide or amino acid sequences, for the 

purposes of the present invention, may be determined by comparing two sequences aligned 

optimally, through a window for comparison. 

The portion of the nucleotide or polypeptide sequence in the window for 

comparison may thus comprise additions or deletions (for example "gaps") relative to the 
10 reference sequence (which does not comprise these additions or these deletions) so as to 

obtain an optimum alignment of the two sequences. 

The percentage is calculated by determining the number of positions at which an 

identical nucleic base or an identical annino acid residue is observed for the two sequences 

(nucleic or peptide) compared, and then by dividing the number of positions at which there 
15 is identity between the two bases or anaino acid residues by the total number of positions in 

the window for comparison, and then multiplying the result by 100 in order to obtain the 

percentage sequence identity. 

The optimum sequence aligmnent for the comparison may be achieved using a 

computer with the aid of known algorithms contained in the package from the company 
20 WISCONSIN GENETICS SOFTWARE PACKAGE, GENETICS COMPUTER GROUP 

(GCG), 575 Science Doctor , Madison, WISCONSIN. 

By way of illustration, it will be possible to produce the percentage sequence 

identity with the aid of the BLAST software (versions BLAST 1.4.9 of March 1996, 

BLAST 2.0,4 of February 1998 and BLAST 2,0.6 of September 1998), using exclusively 
25 the default parameters (Altschul etal, 1990, . Mol. Biol., 215:403-410; Altschul etal, 

1997, Nucleic Acids Res., 25:3389-3402). Blast searches for sequences 

similar/homologous to a reference "request" sequence, with the aid of the Altschul et al. 

algoritiim. The request sequence and the databases used may be of the peptide or nucleic 

types, any combination being possible. 
30 The term "corresponding to" is used herein to refer to similar or homologous 

sequences, whether the exact position is identical or different from the molecule to which 

the similarity or homology is measured. A nucleic acid or amino acid sequence aligmnent 
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may include spaces. Thus, the tenn "corresponding to" refers to the sequence similarity, 
and not the numbering of the amino acid residues or nucleotide bases. 

A gene encoding any one of ABCA5, ABCA6, ABCA9 and ABCAIO polypeptides 
of the invention, whether genomic DNA or cDNA, can be isolated from any source, 
5 particularly from a h\unan cDNA or genomic library. Methods for obtaining genes are well 
known in ttie art, as described above {see, e.g., Sambrook et al., 1989, Molecular cloning: 
a laboratory manual 2ed. Cold Spring Harbor Laboratory, Cold spring Harbor, New 
York). 

Accordingly, any animal cell potentially can serve as the nucleic acid source for the 
10 molecular cloning of any one of ABCA5, ABCA6, ABCA9, and ABCAIO genes. The 
DNA may be obtained by standard procedirres known in the art from cloned DNA (e.^., a 
DNA "library"), and preferably is obtained from a cDNA library prepared from tissues with 
high level expression of the protein and/or the transcripts, by chemical synthesis, by cDNA 
cloning, or by the cloning of genomic DNA, or fragments thereof, purified from the desired 
15 cell (See, for example, Sambrook et aL, 1989, Molecular cloning: a laboratory manual 
2ed. Cold Spring Harbor Labomtory, Cold spring Harbor, New York; Glover, 1985, DNA 
Cloning: A Practical Approach, Volumes I and II Oligonucleotide Synthesis^ MRL Press, 
Ltd., Oxford, U.K). 

Clones derived from genomic DNA may contain regulatory and intron DNA regions 
20 in addition to coding regions; clones derived from cDNA will not contain intron sequences. 
Whatever the soxirce, the gene should be molecularly cloned into a suitable vector for 
propagation of the gene. 

In the molecular cloning of the gene from genomic DNA, DNA fragments are 
generated, some of which will encode the desired gene. The DNA may be cleaved at 
25 specific sites using various restriction enzymes. Alternatively, one may use DNAse in the 
presence of manganese to fragment the DNA, or the DNA can be physically sheared, as for 
example, by sonication. The linear DNA fragments can then be separated according to size 
by standard techniques, including but not limited to, agarose and polyacrylamide gel 
electrophoresis and column chromatography. 
30 Once the DNA fragments are generated, identification of the specific DNA 

fragment containing the desired ABCA5, ABCA, ABCA9, and ABCAIO genes may be 
accomplished in a number of ways. For example, if an amount of a portion of one of 
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ABCA5, ABCA6, ABCA9, and ABCAIO geaes or its specific RNA, or a fragment thereoi^ 
is available and can be purified and labelled, the generated DNA fragments may be 
screened by nucleic acid hybridization to the labelled probe (Benton and Davis, Science 
(1977), 196:180; Grunstein et al., ProcNatl Acad, ScL U.S,A. (1975) 72:3961). For 
5 example, a set of oligonucleotides corresponding to the partial coding sequence 
information obtained for the ABCA5, ABCA6, ABCA9, ABCAIO proteins can be 
prepared and used as probes for DNA encoding any one of ABCA5, ABCA6, ABCA9 and 
ABCAIO, as was done in a specific example, infra^ or as primers for cDNA or mRNA 
(e,g,, in combination with a poly-T primer for RT-PCR), Preferably, a fragment is selected 

10 that is highly unique to one of the ABCA5, ABCA6, ABCA9, and ABCAIO nucleic acids 
or polypeptide of the invention. Those DNA fragments with substantial homology to the 
probe will hybridize. As noted above, the greater the degree of homology, the more 
stringent hybridization conditions can be used. In a specific embodiment, various 
stringency hybridization conditions are used to identify homologous ABCA5, ABCA6, 

15 ABCA9, and ABCAIO genes. 

Further selection can be earned out on the basis of the properties of the gene, g.g:. if 
the gene encodes a protein product having the isoelectric, electrophoretic, amino acid 
composition, or partial amino acid sequence of one of the ABCA5, ABCA6, ABCA9, and 
ABCAIO proteins as disclosed herein. Thus, the presence of the gene may be detected by 

20 assays based on the physical, chemical, or immunological properties of its expressed 
product. For sample, cDNA clones, or DNA clones which hybrid-select the proper 
mRNAs, can be selected which produce a protein that, e.g., has similar or identical 
electrophoretic ndgration, isoelectric focusing or non-equilibrium pH gel electrophoresis 
behaviour, proteolytic digestion maps, or antigenic properties as known for ABCA5, 

25 ABCA6, ABCA9, and ABCAIO. 

ABCA5, ABCA6, ABCA9, and ABCAIO genes of the invention may also be 
identified by mRNA selection, ie^, by nucleic acid hybridization followed by in 
vitro translation. According to this procedure, nucleotide fragments are used to isolate 
complementary mRNAs by hybridization. Such DNA fragments may represent available, 

30 purified ABCA5, ABCA6, ABCA9, ABCAl 0 DNAs, or may be synthetic oligonucleotides 
designed fix)m the partial coding sequence information. Immimoprecipitation analysis or 
fimctional assays (e.g., tyrosine phosphatase activity) of the in vitro translation products of 
the products of the isolated mRNAs identifies the mRNA and, therefore, the 
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complementary DNA fragments, that contain the desired sequences. In addition, specific 
mRNAs may be selected by adsorption of pol5«omes isolated from cells to immobilized 
antibodies specifically directed against any one of the ABCA5, ABCA6, ABCA9, and 
ABCAIO polypeptides of the invention. 
5 Radiolabeled ABCA5, ABCA6, ABCA 9, and ABCAl 0 cDNAs can be synthesized 

using the selected mRNA (from the adsorbed polysomes) as a template. The radiolabeled 
mRNA or cDNA may then be used as a probe to identify homologous ABCA5, ABCA6, 
ABCA9, and ABCAIO DNA fragments from among other genomic DNA fragments. 

'"Variant" of a nucleic acid according to the invention will be understood to mean a 
10 nucleic acid which differs by one or more bases relative to the reference polynucleotide. A 
variant nucleic acid may be of natural origin, such as an allelic variant which exists 
naturally, or it may also be a nonnatural variant obtained, for example, by mutagenic 
techniques. 

In general, the differences between the reference (generally, wild-type) nucleic acid 
15 and the variant nucleic acid are small such that the nucleotide sequences of the reference 
nucleic acid and of the variant nucleic acid are very similar and, in many regions, identical. 
The nucleotide modifications present in a variant nucleic acid may be silent, which means 
that they do not alter the amino acid sequences encoded by said variant nucleic acid. 

However, the changes in nucleotides in a variant nucleic acid may also result in 
20 substitutions, additions or deletions in the polypeptide encoded by the variant nucleic add 
in relation to the polypeptides encoded by the reference nucleic acid. In addition, 
nucleotide modifications in the coding regions may produce conservative or non- 
conservative substitutions in the amino add sequence of the polypeptide. 

Preferably, the variant nucleic acids according to the invention encode polypeptides 
25 which substantially conserve the same fiinction or biological activity as the polypeptide of 
the reference nucleic add or altematively the capacity to be recognized by antibodies 
directed against the polypeptides encoded by the initial reference nucldc add. 

Some variant nucleic adds will thus encode mutated forms of the polypeptides 
whose systematic study will make it possible to deduce structure-activity relationships of 
30 the proteins in question. Knowledge of these variants in relation to the disease studied is 
essential since it makes it possible to understand the molecular cause of the pathology. 

"Fragment'* will be understood to mean a nucleotide sequence of reduced length 
relative to the reference nucleic add and comprising, over the common portion, a 
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nucleotide sequence identical to the reference nucleic acid. Such a nucleic acid "fragment" 
according to the invention may be, where appropriate, included in a larger polynucleotide 
of which it is a constituent Such fragments comprise, or alternatively consist of, 
oligonucleotides ranging in length from 8, 10, 12, 15, 18, 20 to 25, 30, 40, 50, 70, 80, 100, 
5 200, 500, 1000 or 1500 consecutive nucleotides of a nucleic acid according to the 
invention. 

A "nucleic acid molecule" refers to the phosphate ester polymeric form of 
ribonucleosides (adenosine, guanosine, uridine or cytidine; "KNA molecules") or 
deoxyribonucleosides (deoxyadenosine, deoxyguanosine, deoxythymidine, or 

10 deoxycytidine; "DNA molecules"), or any phosphoester anologs thereof, such as 
phosphorothioates and thioesters, in either single stranded form, or a double-stranded helix. 
Double stranded DNA-DNA, DNA-RNA and RNA-RNA helices are possible. The term 
nucleic acid molecule, and in particular DNA or RNA molecule, refers only to the primary 
and secondary structure of the molecule, and does not limit it to any particular tertiary 

15 forms. Thus, this term includes double-stranded DNA found, inter alia^ in linear or 
circular DNA molecules (e,g.y restriction fragments), plasmids, and chromosomes. In 
discussing the structure of particular double-stranded DNA molecules, sequences may be 
described herein according to the normal convention of givmg only the sequence in the 5' 
to 3' direction along the nontranscribed strand of DNA (/.e, the strand having a sequence 

20 homologous to the mRNA). A "recombinant DNA molecule" is a DNA molecule that has 
undergone a molecular biological manipulation. 

A nucleic add molecule is "hybridizable" to another nucleic add molecule, such as 
a cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic add molecule 
can anneal to the other nucleic acid molecule imder the appropriate conditions of 

25 temperature and solution ionic strength {see Sambrook et al., supra). The conditions of 
temperature and ionic strength detemiine the "stringency" of the hybridization. - For 
preliminary screening for homologous nucleic acids, low stringency hybridization 
conditions, corresponding to a Tn^ of 55°, can be used, e.g., 5x SSC, 0.1% SDS, 0.25% 
milk, and no formamide; or 30% formamide, 5x SSC, 0.5% SDS. Moderate stringency 

30 hybridization conditions correspond to a higher Tm, ^.g., 40% formamide, with 5x or 6x 
sec. High stringency hybridization conditions correspond to the highest Tm, e.g., 50% 
formamide, 5x or 6x SCC. Hybridization requires that the two nucleic acids contain 
complementary sequraces, although depending on the stringency of the hybridization, 
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mismatches between bases are possible. The appropriate stringency for hybridizing nucleic 
adds depends on the length of the nucleic adds and the degree of complementation, 
variables well known in the art. The greata: the degree of similarity or homology between 
two nucleotide sequences, the greater the value of Tm for hybrids of nucleic acids having 
5 those sequences. The relative stability (corresponding to higher Tm) of nucleic acid 
hybridizations decreases in the following order: RNA:RNA, DNA:RNA, DNA:DNA. For 
hybrids of greater than 100 nucleotides in loagth, equations for calculating Tm have been 
derived {see Sambrook et al., supra). For hybridization with shorter nucldc acids, z.e, 
oligonucleotides, the position of mismatches becomes more important, and the length of 
10 the oligonucleotide determines its specificity {see Sambrook et al., supra). Preferably a 
minimum length for a hybridizable nucleic acid is at least about 10 nucleotides; preferably 
at least about 15 nucleotides; and more preferably the length is at least about 20 
nucleotides. 

In a specific embodiment, the term "standard hybridization conditions" refers to a 
15 Tm of 55^C, and utilizes conditions as set forth above, hi a preferred embodnnent, the Tm 
is 60°C; in a more preferred embodiment, the Tm is 65**C. 

*Tiigh stringency hybridization conditions" for the purposes of the present invention 
will be understood to mean the following conditions: 

1- Membrane competition and PREHYBRIDIZATION : 
20 - Mix: 40 fil salmon sperm DNA (10 mg/ml) 

+ 40 (xl human placental DNA (10 mg/ml) 

- Denature for 5 minutes at 96°C, then immerse the mixture in ice. 

- Remove the 2XSSC and pour 4 ml of foimamide mix in the hybridi2ation tube 
containing the membranes. 

25 - Add the mixture of the two denatured DNAs. 

- hicubation at 42°C for 5 to 6 hours, with rotation. 

2- Labeled probe competition : 

- Add to the labeled and purified probe 10 to 50 \i\ Cot I DNA, depending on tiie quantity 
30 ofrepeats. 

- Denature for 7 to 10 minutes at 95°C. 

- Incubate at 65^C for 2 to 5 hours. 



f 
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3- HYBRIDIZATION : 

- Remove the prehybridization mix. 

- Mix 40 |iil salmon sperm DNA + 40 jil human placental DNA; denature for 5 min at 
96°C, then immerse in ice. 

5 - Add to the hybridization tube 4 ml of fonnamide nodx, the mixture of the two DNAs and 
the denatured labeled probe/Cot I DNA . 

- Incubate 15 to 20 hours at 42^C, with rotation. 

4- Washes and Exposure : 

10 - One wash at room temperature in 2X SSC, to rinse. 

- Twice 5 minutes at room temperature 2X SSC and 0.1% SDS at 65°C. 

- Twice 15 minutes O.IX SSC and 0.1% SDS at 65^C. 

- Envelope the membranes in clear plastic wrap and expose. 

15 The hybridization conditions described above are adapted to hybridization, under 

high stringency conditions, of a molecule of nucleic acid of varying length from 
20 nucleotides to several hundreds of nucleotides. It goes without saying that the 
hybridization conditions described above may be adjusted as a function of the length of the 
nucleic acid whose hybridization is sought or of the type of labeling chosen, according to 

20 techniques known to one skilled in the art. Suitable hybridization conditions may, for 
example, be adjusted according to the teaching contained in the manual by Hames and 
Higgins (1985, supra). 

As used herein, the term "oUgonucleotide" refers to a nucleic add, generally of at 
least 15 nucleotides, that is hybridizable to a nucleic acid according to the inventiorL 

25 Oligonucleotides can be labelled, e.g. , with ^^P-nucleotides or nucleotides to which a label, 
such as biotin, has been covalently conjugated. In one embodiment, a labeled 
oligonucleotide can be used as a probe to detect the presence of a nucleic acid encoding an 
ABCA5-6, 9-10 polypeptide of the invention. In another embodiment, oligonucleotides 
(one or both of which may be labelled) can be used as PCR primers, either for cloning ftill 

30 lengths or fragments of any one of the ABCA5, ABCA6, ABCA9,and ABCAIO nucleic 
acids, or to detect the presence of nucleic acids encoding any one of the ABCA5, ABCA6, 
ABCA9, and ABCAIO. In a fiarther embodiment, an oligonucleotide of the invention can 
form a triple helix with any one of the ABCA5, ABCA6, ABCA9, and ABCAIO DNA 
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molecules. Generally, oligonucleotides are prepared synthetically, preferably on a nucleic 
acid synthesizer. Accordingly, oligonucleotides can be prepared with non-naturally 
occurring phosphoester analog bonds, such as thioester bonds, etc. 

"Homologous recombination" refers to the insertion of a foreign DNA sequence of 

5 a vector in a chromosome. Preferably, the vector targets a specific chromosomal site for 
homologous recombination. For specific homologous recombination, the vector will 
contain suflHciently long regions of homology to sequences of the chromosome to allow 
complementary binding and incorporation of the vector into the chromosome. Longer 
regions of homology, and greater degrees of sequence similarity, may increase the 

10 efficiency of homologous recombination. 

A DNA "coding sequence" is a double-stranded DNA sequence which is 
transcribed and translated into a polypeptide in a cell in vitro or in vivo when placed under 
the control of appropriate regulatory sequences. The boundaries of the coding sequence are 
determined by a start codon at the 5' (amino) terminus and a translation stop codon at the 

15 3' (carboxyl) terminus. A coding sequence can include, but is not limited to, prokaryotic 
sequences, cDNA firom eukaryotic mRNA, genomic DNA sequences firom eukaryotic (e.g., 
mammalian) DNA, and even synthetic DNA sequences. If the coding sequence is intended 
for expression in a eukaryotic cell, a polyadenylation signal and transcription termination 
sequence will usually be located 3' to the coding sequence. 

20 Transcriptional and translational control sequences are DNA regulatory sequences, 

such as promoters, enhancers, terminators, and the like, (hat provide for the expression of a 
coding sequence in a host cell. In eukaryotic cells, polyadenylation signals are control 
sequences. 

"Regulatory region" means a nucleic add sequence which regulates the expression 
25 of a nucleic acid. A regulatory region may include sequences which are naturally 
responsible for expressing a particular nucleic acid (a homologous region) or may include 
sequences of a different origin (responsible for expressing different proteins or even 
synthetic proteins). In particular, the sequences can be sequences of eukaryotic or viral 
genes or derived sequences which stimulate or repress transcription of a gene in a specific 
30 or non-specific manner and in an inducible or non-inducible manner. Regulatory regions 
include origins of repUcation, RNA splice sites, enhancers, transcriptional termination 
sequences, signal sequences which direct the polypeptide mto the secretory pathways of the 
target cell, and promoters. 
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A regulatory region from a *lieterologous source" is a regulatory region which is 
not naturally associated with the expressed nucleic acid. lacluded among the heterologous 
regulatory regions are regulatory regions from a different species, regulatory regions from a 
different gene, hybrid regulatory sequences, and regulatory sequences which do not occur 
5 in nature, but which are designed by one having ordinary skill ia the art. 

A "cassette" refers to a segment of DNA that can be inserted into a vector at 
specific restriction sites. The segment of DNA encodes a polypeptide of interest, and the 
cassette and restriction sites are designed to ensure insertion of the cassette in the proper 
reading frame for transcription and translation. 

10 A "promoter sequence" is a DNA regulatory region capable of binding RNA 

polymerase in a cell and initiating transcription of a downstream (3' direction) coding 
sequence. For purposes of defining the present invention, the promoter sequence is 
bounded at its 3' terminus by the transcription initiation site and extends upstream (5' 
direction) to include the minimum number of bases or elements necessary to initiate 

15 transcription at levels detectable above background. Within the promoter sequence will be 
found a transaiption initiation site (conveni«itly defined for example, by mapping with 
nuclease SI), as well as protein binding domains (consensus sequences) responsible for the 
binding of RNA polymerase, 

A coding sequence is "under the control" of transcriptional and translational control 

20 sequences in a cell when RNA polymerase transcribes the coding sequence into mRNA, 
which is then trans-RNA spliced and translated into the protein encoded by the coding 
sequence. 

A "signal sequence" is included at the beguining of the coding sequence of a protein 
to be expressed on the surface of a cell. This sequence encodes a signal peptide, N- 
25 terminal to the mature polypeptide, that directs Ihe host cell to translocate the polypeptide. 
The term "translocation signal sequence" is used herein to refer to this sort of signal 
sequence. Translocation signal sequences can be found associated with a variety of 
proteins native to eukaryotes and prokaryotes, and are often functional in both types of 
organisms. 

30 A ''polypeptide" is a polymeric compoxind comprised of covalently linked amino 

acid residues. Amino acids have the following general structure: 



wo 02/46458 



5 

Amino acids are classified into seven groups on the basis of the side chain R: (1) aliphatic 
side chains, (2) side chains containing a hydroxylic (OH) group, (3) side chains containing 
sulfur atoms, (4) side chains containing an acidic or amide group, (5) side chains 
10 containing a basic group, (6) side chains containing an aromatic ring, and (7) proline, an 
imino acid in which the side chain is fused to the amino group. 

A "protein" is a polypeptide which plays a structural or functional role in a living 

cell. 

The polypeptides and proteins of the invention may be ^ycosylated or 
15 unglycosylated. 

'Homology" means similarity of sequence reflecting a common evolutionary origin. 
Polypeptides or proteins are said to have homology, or similarity, if a substantial number of 
their amino acids are either (1) identical, or (2) have a chemically similar R side chain. 
Nucleic adds are said to have homology if a substantial number of their nucleotides are 
20 identical. 

"Isolated polypeptide" or "isolated protein" is a polypeptide or protein which is 
substantially fi:ee of those compounds fliat are normally associated therewith in its natural 
state (e.g., other proteins or polypeptides, nucleic acids, carbohydrates, lipids). "Isolated" 
is not meant to exclude artificial or synthetic mixtures with other compounds, or the 

25 presence of impurities which do not interfere with biological activity, and which may be 
present, for example, due to incomplete purification, addition of stabilizers, or 
compounding into a pharmaceutically acceptable preparation. 

'Tragment" of a polypeptide according to the invention will be unda^tood to mean 
a polypq)tide whose amino acid sequence is shorter than that of the reference polypeptide 

30 and which comprises, over the entire portion with these reference polypeptides, an identical 
amino acid sequence. Such firagnients may, where appropriate, be included in a larger 
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polypeptide of which they are a part. Such Segments of a polypeptide according to the 
invention may have a length of 5, 10, 15, 20, 30 to 40, 50, 100, 200 or 300 amino acids. 

^'Variant" of a polypeptide according to the invention will be understood to mean 
mainly a polypeptide whose amino add sequence contains one or more substitutions, 
5 additions or deletions of at least one amino acid residue, relative to the amino acid 
sequence of fhe reference polypeptide, it being imderstood that the amino acid substitutions 
may be either conservative or nonconservative. 

A 'Variant" of a polypeptide or protein is any analogue, fragment, derivative, or 
mutant which is derived from a polypeptide or protein and which retains at least one 

10 biological property of the polypeptide or protein. Different variants of the polypeptide or 
protein may exist in nature. These variants may be allelic variations characterized by 
differences in the nucleotide sequences of the structural gene coding for the protein, or may 
involve differential splicing or post-translational modification. Variants also include a 
related protein having substantially the same biological activity, but obtained from a 

15 differ^t species. 

The skilled artisan can produce variants having single or multiple amino acid 
substitutions, deletions, additions, or replacements. These variants may include, inter alia: 
(a) variants in which one or more amino acid residues are substituted with conservative or 
non-conservative amino acids, (b) variants in which one or more amino adds are added to 

20 the polypeptide or protein, (c) variants in which one or more of the amino acids includes a 
substituent group, and (d) variants in whidi the polypeptide or protein is fused with another 
polypeptide such as serum albumin. The techniques for obtaining these variants, including 
genetic (suppressions, deletions, mutations, etc.), chemical, and enzymatic techniques, are 
known to persons having ordinary skill in the art 

25 If such allelic variations, analogues, fragments, derivatives, mutants, and 

modifications, including alternative mRNA spUcing forms and alternative post- 
translational modification forms result in derivatives of the polypeptide which retain any of 
the biological properties of the polypeptide, they are intended to be included within the 
scope of this invention. 

30 A "vector" is a replicon, such as plasmid, virus, phage or cosmid, to which another 

DNA segment may be attached so as to bring about the replication of the attached segment. 
A "replicon" is any genetic element (e.g., plasmid, chromosome, virus) that functions as an 
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autonomous unit of DNA replication in vivo, ie,, capable of replication under its own 
control. 

The present invention also relates to cloning vectors containing genes encoding 
analogs and derivatives any of the ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides 
5 of the invention, that have the same or homologous functional activity as that of ABCA5, 
ABCA6, ABCA9, ABCAIO polypeptides, and homologs thereof from other species. The 
production and use of derivatives and analogs related to ABCA5, ABCA6, ABCA9, and 
ABCAIO are within the scope of the present invention. In a specific embodiment, the 
derivatives or analogs are functionally active, Le,, capable of exhibiting one or more 

10 functional activities associated with a full-length, wild-type ABCA5, ABCA6, ABCA9, 
and ABCA 1 0 polypeptides of the invention. 

ABCA5, ABCA6, ABCA9, and ABCAIO derivatives can be made by altering 
encoding nucleic add sequences by substitutions, additions or deletions that provide for 
functionally equivalent molecules. Preferably, derivatives are made that have enhanced or 

15 inaeased functional activity relative to native ABCA5, ABCA6, ABCA9, and ABCAIO. 
Altematively, such derivatives may encode soluble fragments of the ABCA5, ABCA6, 
ABCA9, and ABCAIO extracellular domains that have the same or greater affinity for the 
natural ligand of ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides of the invention. 
Such soluble derivatives may be potent inhibitors of ligand binding to ABCA5, ABCA6, 

20 ABCA9, and ABCAIO, 

Due to the degeneracy of nucleotide coding sequences, other DNA sequences which 
encode substantially same amino acid sequences as that of ABCAS, ABCA6, ABCA9, and 
ABCAIO genes may be used in the practice of the present invention. These include but are 
not limited to allelic genes, homologous genes from other species, and nucleotide 

25 sequences comprising all or portions of ABCAS, ABCA6, ABCA9, and ABCAIO genes 
which are altered by the substitution of different codons that encode the same amino acid 
residue within the sequence, thus producing a silent change. Likewise, the ABCAS, 
ABCA6, ABCA9, and ABCAIO derivatives of the invention include, but are not limited to, 
those containing, as a primary amino acid sequence, all or part of the amino acid sequence 

30 of any one of the ABCAS, ABCA6, ABCA9, and ABCAIO proteins including altered 
sequences in which functionally equivalent anndno acid residues are substituted for residues 
within the sequence resulting in a consCTvative amino acid substitution. For example, one 
or more amino acid residues within the sequence can be substituted by another amino acid 
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of a similar polarity, which acts as a functional eqiiivalent, resulting in a silent alteration. 
Substitutes for an amino acid within the sequmce may be selected from other members of 
the class to which the amino acid belongs. For example, the nonpolar (hydrophobic) amino 
adds include alanine, leucine, isoleucine, valine, proline, phmylalanine, tryptophan and 
S methionine. Amino acids containing aromatic ring structures are phenylalanine, 
tryptophan, and tyrosine. The polar neutral amino acids include glycine, serine, threonine, 
cj^teine, tyrosine, asparagine, and glutamine. The positively charged (basic) amino acids 
include arginine, lysine and histidine. The negatively charged (acidic) amino acids include 
aspartic acid and glutamic add. Such alterations will not be expected to afifect apparent 
10 molecular weight as determined by polyacrylamide gel electrophoresis, or isoelectric point. 
Particularly preferred substitutions are: 

- Lys for Arg and vice versa such that a positive charge may be maintained; 

- Glu for Asp and vice versa such that a negative charge may be maintained; 

- Ser for Thr such that a free -OH can be maintained; and 
1 5 - Gin for Asn such that a free CONH2 can be maintained. 

Amino acid substitutions may also be introduced to substitute an amino acid with a 
particularly preferable property. For example, a Cys may be introduced as a potential site 
for disulfide bridges with another Cys. A His may be introduced as a particularly 
"catalytic" site (i.e.. His can act as an acid or base and is the most common amino add in 

20 biochemical catalysis). Pro may be introduced because of its particularly planar structure, 
which induces b-tums in the protein's structure. 

The genes encoding ABCA5, ABCA6, ABCA9, and ABCAIO derivatives and 
analogs of the invention can be produced by various methods known in the art. The 
manipulations which result in their production can occur at the gene or protein level. For 

25 example, the cloned ABCA5, ABCA6, ABCA9, and ABCAIO sequences can be modified 
by any of nimierous strategies known in the art (Sambrook et al., 1989, supra). The 
sequence can be cleaved at appropriate sites with restriction endonuclease(s), followed by 
further enzymatic modification if desired, isolated, and ligated in vitro. Production of a 
gene encoding a derivative or analog of any one of the ABCA5, ABCA6, ABCA9, and 

30 ABCAIO, should ensure that the modified gene remains within the same translational 
reading frame as the ABCA5, ABCA6, ABCA9, and ABCAIO genes, uninterrupted by 
translational stop signals, in the region where the desired activity is encoded. 
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Additionally, the ABCA5, ABCA6, ABCA9, and ABCAlO-encoding nucleic adds 
can be mutated in vitro or in vivo^ to create and/or destroy translation, initiation, and/or 
termination sequences, or to create variations in coding regions and/or form new restriction 
endonuclease sites or destroy pre-existing ones, to facilitate further in vitro modification. 
5 Preferably, such mutations enhance the functional activity of the mutated ABCA5, 
ABCA6, ABCA9, and ABCAl 0 genes products. Any technique for mutagenesis known in 
the art may be used, including inter alia, in vitro site-directed mutagenesis (Hutchinson et 
al., (1978) Biol. Chem. 253:6551; ZoUer and Smith, (1984) DNA, 3:479-488; OUphant et 
al., (1986) Gene 44:177; Hutchinson et al., (1986) Proa Natl Acad. Sci U,SA. 83:710; 
10 Huygen et al,, (1996) Nature Medicine, 2(8):893-898) and use of TAB® linkers 
(Phannacia). PGR techniques are preferred for site-directed mutagenesis (Higuchi, 1989, 
"Using PGR to Engineer DNA", in PGR Technology: Principles and Applications for DNA 
Amplification, H. Erlich, ed., Stockton Press, Chapter 6, pp. 61-70). 

Identified and isolated ABCA5, ABCA6, ABCA9, and ABCAIO genes may then be 

15 inserted into an appropriate cloning vector. A large number of vector-host systems known 
in the art may be used. Possible vectors include, but are not limited to, plasmids or 
modified viruses, but the vector system must be compatible with the host cell used. 
Examples of vectors include, but are not limited to, Escherichia coliy bacteriophages such 
as lambda derivatives, or plasmids such as pBR322 derivatives or pUC plasmid 

20 derivatives, e.g., pGEX vectors, pmal-c, pFLAG, etc. The insertion into a cloning vector 
can, for example, be accomplished by ligating the DNA fi-agment into a cloning vector 
which has complementary cohesive termini. However, if the complementary restriction 
sites used to fragment the DNA are not present in the cloning vector, the ends of the DNA 
molecules may be enzymatically modified. Alternatively, any site desired may be produced 

25 by ligating nucleotide sequences (linkers) onto the DNA termini; these ligated linkers may 
comprise specific chemically synthesized oligonucleotides encoding restriction 
endonuclease recognition sequences. Recombinant molecules can be introduced into host 
cells via transformation, transfection, infection, electroporation, etc., so that many copies of 
the gene sequence are generated. Preferably, the cloned gene is contained on a shuttle 

30 vector plasmid, which provides for expansion in a cloning cell, e.g., Escherichia coli, and 
facile purification for subsequent insertion into an appropriate expression cell line, if such 
is desired. For example, a shuttle vector, which is a vector that can repUcate in more than 
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one type of organism, can be prepared for replication in both Escherichia colt and 
Saccharomyces cerevisiae by linking sequences from an Escherichia coli plasmid with 
sequences fotm the yeast 2m plasmid. 

In an alternative method, the desired gene may be identified and isolated after 
S insertion into a suitable cloning vector in a "shot gun" approach. Enrichment for the 
desired gene, for example, by size fractionation, can be done before insertion into the 
cloning vector. 

The nucleotide sequence coding for ABCA5, ABCA6, ABCA9, and ABCAIO 
polypeptides or antigenic fragments, derivatives or analogs thereof, or ftinctionally active 

10 derivatives, including chimeric proteins thereof, may be inserted into an appropriate 
expression vector, a vector which contains the necessary elements for the transcription 
and translation of the inserted protein-coding sequence. Such elements are termed herein a 
"promoter." Thus, nucleic acids encoding ABCA5, ABCA6, ABCA9, and ABCAIO 
polypeptides of the invention are operationally associated with a promoter in an expression 

15 vector of the invention. Both cDNA and genomic sequences can be cloned and expressed 
under control of such regulatory sequences. An expression vector also preferably includes 
a replication origin. 

The necessary transcriptional and translational signals can be provided on a 
recombinant expression vector, or they may be supplied by a native gene encoding 

20 ABCA5, ABCA6, ABCA9, and ABCAl 0 and/or its flanking regions. 

Potential host-vector systems include but are not limited to mammalian cell systems 
infected with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell systems infected with 
virus (e.g;, baculovirus); microorganisms such as yeast containing yeast vectors; or bacteria 
transformed with bacteriophage, DNA, plasmid DNA, or cosmid DNA. The expression 

25 elements of vectors vary in their strengths and specificities. Depending on the host-vector 
system utilized, any one of a number of suitable transcription and translation elements may 
be used. 

A recombinant ABCA5, ABCA6, ABCA9, and ABCAIO proteins of the invention, 
or functional fragments, derivatives, chimeric constructs, or analogs thereof, may be 
30 expressed chromosomally, after integration of the coding sequence by recombination. In 
this regard, any of a number of amplification systems may be used to achieve high levels of 
stable gene expression {See Sambrook et al., 1989, supra). 
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The cell into which the recombinant vector comprising the nucleic acid encoding 
any one of the ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides according to the 
invention is cultured in an appropriate cell culture medium under conditions that provide 
for expression of any one of the ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides by 
5 the cell. 

Any of the methods previously described for the insertion of DNA fragments into a 
cloning vector may be used to construct expression vectors containing a gene consisting of 
appropriate transcriptional/translational control signals and the protein coding sequences. 
These methods may include in vitro recombinant DNA and synthetic techniques and in 

10 vivo recombination (genetic recombination). 

Expression of ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides may be 
controlled by any promoter/enhancer element known in the art, but these regulatory 
elements must be functional in the host selected for expression. Promoters which may be 
used to control ABCA5, ABCA6, ABCA9, and ABCAIO genes expression include, but are 

15 not limited to, the SV40 early promoter region (Benoist and Chambon, 1981 Nature 
290:304-310), the promoter contained in the 3' long terminal repeat of Rous sarcoma virus 
(Yamamoto, et al., 1980 Cell 22:787-797), the herpes thymidine kinase promoter (Wagner 
et al., 1981 Proc. Natl. Acad. Sci, U.Sj\. 78:1441-1445), the regulatory sequences of the 
metallothionein gene (Brinster et al., 1982 Nature 296:39-42); prokaryotic expression 

20 vectors such as the P-lactamase promoter (Villa-KamarofiF, et al., 1978 Proc. Natl. Acad. 
Sci. U.S.A. 75:3727-3731), or the tac promoter (DeBoer, et al., 1983 Proc. Natl. Acad. Sci. 
U.S.A. 80:21-25); see also "Useftd proteins from recombinant bacteria" in Scientific 
American, 1980, 242:74-94; promoter elements from yeast or other fungi such as the Gal 4 
promoter, the ADC (alcohol dehydrogenase) promoter, PGK (phosphoglycerol kinase) 

25 promoter, alkaline phosphatase promoter; and the animal transcriptional control regions, 
which exhibit tissue specificity and have been utilized m transgenic animals: elastase I 
gene control region which is active in pancreatic acinar cells (Swift et al., 1984 Cell 
38:639-646; Qmitz et al., 1986 Cold Spring Harbor Symp. Quant. Biol. 50:399-409; 
MacDonald, 1987); insulin gene control region which is active in pancreatic beta cells 

30 (Hanahan, 1985 Nature: 315:115-122), immunoglobulin gene control region which is 
active in lymphoid cells (Grosschedl et al., 1984 Cell 38:647-658; Adames et al., 1985 
Nature 318:533-538; Alexander et al., 1987 Mol. Cell. Biol. 7:1436-1444), mouse 
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mammary tumor virus control region which is active in testicular, breast, lymphoid and 
mast cells (Leder et al., 1986 Cell 45:485-495), albumin gene control region which is 
active in liver (Pinkert et al., 1987 Genes and DeveL 1:268-276), alpha-fetoprotein gene 
control region which is active in liver (Krumlauf et al., 1985 Mol. Cell. Biol. 5:1639-1648; 
5 Hammer et al., 1987 Science 235:53-58), alpha 1-antitrypsin gene control region which is 
active in the liver (Kelsey et aL, 1987 Genes and Devel. 1:161-171) beta-globin gene 
control region which is active in myeloid cells (Mogram et al., 1985 Nature 315:338-340; 
KoUias et al., 1986 Cell 46:89-94), myelin basic protein gene control region which is active 
in oligodendrocyte cells in the brain (Readhead et al., 1987 Cell 48:703-712), myosin light 

10 chain-2 gene control region which is active in skeletal muscle (Sani, 1985 Nature 3 14:283- 
286), and gonadotropic releasing hormone gene control region which is active in the 
hypothalamus (Mason et al., 1986 Science 234:1372-1378). 

Expression vectors containing a nucleic acid encoding one of ABCA5, ABCA6, 
ABCA9, and ABCAIO polypeptides of the invention can be identified by five general 

15 approaches: (a) polymerase chain reaction (PCR) amplification of the desired plasmid 
DNA or specific mRNA, (b) nucleic acid hybridization, (c) presence or absence of 
selection marker gene fimctions, (d) analyses with appropriate restriction endonucleases, 
and (e) expression of inserted sequences. In the first approach, the nucleic adds can be 
amplified by PCR to provide for detection of the amplified product. In flie second 

20 approach, the presence of a foreign gene inserted in an expression vector can be detected by 
nucleic acid hybridization using probes comprising sequences that are homologous to an 
inserted marker gene. In the third approach, the recombinant vector/host system can be 
identified and selected based upon the presence or absence of certain "selection marker" 
gene fimctions (e.g., b-galactosidase activity, thymidine kinase activity, resistance to 

25 antibiotics, transformation phenotype, occlusion body formation in baculovirus, etc.) 
caused by the insertion of foreign genes in the vector. In another example, if the nucleic 
acid encoding any one of the ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides is 
inserted within the "selection marker" gene sequence of the vector, recombinants 
containing ABCA5, ABCA6, ABCA9, and ABCAIO nucleic acids inserts can be identified 

30 by the absence of tiie ABCA5, ABCA6, ABCA9, and ABCAIO genes functions. In the 
fourth approach, recombinant expression vectors are identified by digestion with 
appropriate restriction enzymes. In the fifth approach, recombinant expression vectors can 
be identified by assaying for die activity, biochemical, or immunological characteristics of 
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the gene product expressed by the recombinant, provided that the expressed protdn 
assumes a functionally active conformation. 

A wide variety of host/expression vector combinations may be employed in 
expressing the nucleic acids of this invention. Useful expression vectors, for example, may 
5 consist of segments of chromosomal, non-chromosomal and synthetic DNA sequences. 
Suitable vectors include derivatives of SV40 and known bacterial plasmids, e.g., 
Escherichia coli plasmids col El, pCRl, pBR322, pMal-C2, pET, pGEX (Smith et al, 
1988, Gene 67:31-40), pMB9 and their derivatives, plasmids such as RP4; phage DNAs, 
e.g., the numerous derivatives of phage 1, e.g., NM989, and other phage DNA, e.g., Ml 3 
10 and filamentous single stranded phage DNA; yeast plasmids such as the 2m plasmid or 
derivatives thereof; vectors useful in eukaryotic cells, such as vectors useful in insect or 
mammalian cells; vectors derived fi-om combinations of plasmids and phage DNAs, such 
as plasmids that have been modified to employ phage DNA or other expression control 
sequences; and the like. 

15 For example, in a baculovirus expression systems, both non-fusion transfer vectors, 

such as but not limited to pVL941 {BamUX cloning site; Summers), pVL1393 (fiawHl, 
Smal, Xbaly EcoRl, Notl, XmaTO, BglO, and Pstl cloning site; Invitrogen), pVL1392 (Sgfll, 
Pstl, Notl, XmaUl, EcoRI^ Xba\ Sim\ and jSa/wHl cloning site; Sunomers and Invitrogen), 
and pBlueff^rcin (BaiwHl, BglO., Pstl, Ncol, and HindHl cloning site, with blue/white 

20 recombinant screening possible; Invitrogen), and fusion transfer vectors, such as but not 
limited to pAc700 (BamUl and Kpnl cloning site, in which the BamUl recognition site 
begins with the initiation codon; Summers), pAc701 and pAc702 (same as pAc700, with 
different reading firames), pAc360 {BamKl cloning site 36 base pairs downstream of a 
polyhedrin initiation codon; Invitrogen(195)), and pBlueBacHisA, B, C (toee different 

25 reading fimaes, with BamUl, Bgin, Pstly JVcoI, and HindHi cloning site, an N-terminal 
peptide for ProBond purification, and blue/white recombinant screening of plaques; 
Invitrogen (220) can be used. 

Mammalian expression vectors contemplated for use in the invention include 
vectors with inducible promoters, such as the dihydrofolate reductase (DHFR) promoter, 

30 e.g., any expression vector with a DHFR expression vector, or a Z)HFif/mefhotrexate co- 
amplification vector, such as pED {Pstl, SaR, Sba\ Sma\ and Ecd^ cloning site, with the 
vector expressing both the cloned gene and DHFR\ See, Kaufinan, Current Protocols in 
Molecular Biology, 16.12 (1991). Alternatively, a glutamine synthetase/methionine 
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sulfoximine co-amplification vector, such as pEE14 (HindBl, Xbal^ SmdU Sbal, EcoKl, and 
BcE cloning site, in which the vector expresses glutamine synthase and the cloned gene; 
Celltech). In another embodiment, a vector that directs episomal expression mider control 
of Epstein Bair Virus (EBV) can be used, such as pREP4 (5amHl, ,5^1, XhoJ^ Notl, Nhel, 
5 HiTidOi, Nhe\ PvwII, and Kpnl cloning site, constitutive RS V-LTR promote, hygromycm 
selectable marker; Invitrogen), pCEP4 (5flmHl, Sfi\ Xho\ Not\ Nhel, Hindm, Nhel, 
PvuUy and Kpnl cloning site, constitutive hCMV immediate early gene, hygromycin 
selectable marker; Invitrogen), pMEP4 (Kpnl, Pvul, Nhel Hindm, Notl, Xhol, Sfil, BamRl 
cloning site, inducible methallothionein Ea gene promoter, hygromycin selectable marker: 
10 Livitrogen), pREP8 (Bamm, Xhol, Notl, Hindm, Nhel, and Kpnl cloning site, RSV-LTR 
promoter, histidinol selectable marker; Invitrogen), pREP9 (Kpnl, Nhel, HindSl, Notl, 
Xhol, Sfll, and BamHI cloning site, RSV-LTR promoter, G418 selectable marker; 
Invitrogen), and pEBVHis (RSV-LTR promoter, hygromycin selectable marker, N-terminal 
peptide purifiable via ProBond resin and cleaved by enterokinase; Invitrogen). Selectable 
15 mammalian expression vectors for use in the invention include pRc/CMV (HindUl, BstXl, 
Notl, Sbal, and Apal cloning site, G418 selection; Invitrogai), pRo^RSV (HindHI, Spel, 
BstXl, NotU Xbal cloning site, G418 selection; Invitrogen), and others. Vaccinia virus 
mammalian expression vectors (see, Kaufioaan, 1991, supra) for use according to the 
invention include but are not limited to pSCl 1 (Smal cloning site, TK- and b-gal selection), 
20 pMJ601 (iSkz/I, Smal, AfH, Narl BspMO, BamHI, Apal, Nhel, Sacll, Kpnl, and Hindm 
cloning site; TK- and b-gal selection), and pTKgptFlS (^coRI, Pstl, Sail, Accl, Hindll, 
Sbal, Bamin, and Hpa cloning site, TK or XPRT selection). 

Yeast expression systems can also be used according to the invention to express any 
one of the ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides. For example, tiie non- 
25 fusion pYES2 vector (Xbal, Sphl, Shol, Notl, GstXl, Ecom, BstXI, BamHI, Sad, Kpnl, 
and Hindm cloning sit; Invitrogen) or the fusion pYESHisA, B, C (Xbal, Sphl, Shol, Notl, 
BsiXl, EcoKl, BamHI, Sad, Kpnl, and Hindm cloning site, N-terminal peptide purified 
with ProBond resin and cleaved with enterokinase; Invitrogen), to mention just two, can be 
employed according to the invention. 
30 Once a particular recombinant DNA molecule is identified and isolated, several 

methods known in the art may be used to propagate it. Once a suitable host system and 
growth conditions are estabhshed, recombinant expression vectors can be propagated and 
prepared in quantity. As previously explained, the expression vectors which can be used 
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include^ but are not limited to, the following vectors or their derivatives: himian or animal 
viruses such as vaccinia virus or adenovirus; insect viruses such as bacxilovirus; yeast 
vectors; bacteriophage vectors {e.g.^ lambda), and plasmid and cosmid DNA vectors, to 
name but a few. 

5 In addition, a host cell strain may be chosen which modulates the expression of the 

inserted sequences, or modifies and processes the gene product in the specific fashion 
desired. Different host cells have characteristic and specific mechanisms for the 
translational and post-translational processing and modification glycosylation, 
cleavage for example of the signal sequence) of proteins. Appropriate cell lines or host 

10 systems can be chosen to ensure the desired modification and processing of the foreign 
protein expressed. For example, expression in a bacterial system can be used to produce an 
nonglycosylated core protein product. However, the transmembrane ABCA5, ABCA6, 
ABCA9, and ABCAIO proteins expressed in bacteria may not be properly folded. 
Expression in yeast can produce a glycosylated product. Expression in eukaryotic cells can 

15 increase the likelihood of "native" glycosylation and folding of a heterologous protein. 
Moreover, expression in n^inmalian cells can provide a tool for reconstituting, or 
constituting, ABCA5, ABCA6, ABCA9, and ABCAIO activities. Furthermore, different 
vector/host expression systems may affect processing reactions, such as proteolytic 
cleavages, to a different extent 

20 Vectors are introduced into the desired host cells by methods known in the art, e.g. , 

transfection, electroporation, microinjection, transduction, cell fiision, DEAE dextran, 
calcium phosphate precipitation, lipofection (lysosome fusion), use of a gene gun, or a 
DNA vector transporter (see, e.g., Wu et aL, 1992, J. Biol. Chem. 267:963-967; Wu and 
Wu, 1988, J. Biol. Chem. 263:14621-14624; Hartmut et al., Canadian Patent Application 

25 No. 2,012,311, filed March 15, 1990). 

A cell has been "transfected" by exogenous or heterologous DNA when such DNA 
has been introduced inside the cell. A cell has been "transformed" by exogenous or 
heterologous DNA when the transfected DNA effects a phenotypic change. Preferably, the 
transforming DNA should be integrated (covalently linked) into chromosomal DNA 

30 making up the genome of the cell. 

A recombinant marker protein expressed as an integral membrane protein can be 
isolated and purified by standard methods. Generally, the integral membrane protem can 
be obtained by lysing the membrane with detergents, such as but not limited to, sodium 
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dodecyl sulfate (SDS), Triton X-100 polyoxyethylene ester, Ipagel/nonidet P-40 (NP-40) 
(octylphenoxy)-polyethoxyethanol, digoxin, sodium deoxycholate, and the like, including 
mixtures thereof. Solubilization can be enhanced by sonication of the suspension. Soluble 
forms of the protein can be obtained by collecting culture fluid, or solubilizing inclusion 
5 bodies, e.g:, by treatment with detergent, and if desired sonication or other mechanical 
processes, as described above. The solubilized or soluble protein can be isolated using 
various techniques, such as polyacrylamide gel electrophoresis (PAGE), isoelectric 
focusing, 2-dimensional gel electrophoresis, chromatography (e.g., ion exchange, affinity, 
immunoaffinity, and sizing column chromatography), centrifiigation, differential solubility, 

10 immunoprecipitation, or by any other standard technique for the purification of proteins. 

Alternatively, a nucleic acid or vector according to the invention can be introduced 
in vivo by lipofection. For the past decade, there has been increasing use of liposomes for 
encapsulation and transfection of nucleic acids in vitro. Synthetic cationic lipids designed 
to limit the difficulties and dangers encountered with liposome mediated transfection can 

15 be used to prepare liposomes for in vivo transfection of a gene encoding a marker (Feigner, 
et. al. (1987. PNAS 84/7413); Mackey, et al. (1988. Proc. Natl. Acad. Sci, USA 85 :8027- 
8031); Ulmer et al. (1993. Science 259:1745-1748). The use of cationic lipids may 
promote encapsulation of negatively charged nucleic adds, and also promote fusion with 
negatively charged cell membranes (Feigner and Ringold, (1989, Science 337:387-388)). 

20 Particularly use&l lipid compounds and compositions for transfer of nucleic adds are 
described in International Patent Publications W095/18863 and W096/17823, and in U.S. 
Patent No. 5,459,127. The use of lipofection to introduce exogenous genes into the 
specific organs in vivo has certain practical advantages. Molecular targeting of liposomes 
to spedfic cells represents one area of benefit. It is clear that directing transfection to 

25 particular cell types would be particularly preferred in a tissue with cellular heterogendty, 
such as pancreas, liver, kidney, and the brain. Lipids may be chemically coupled to other 
molecules for the purpose of targeting [see Mackey, et. al., supra]. Targeted peptides, e.g., 
hormones or neurotransmitters, and proteins such as antibodies, or non-peptide molecules 
could be coupled to liposomes chemically. 

30 Other molecules are also iisefiil for facilitating transfection of a nucleic acid in vivo, 

such as a cationic oligopeptide (e.g., Intemational Patent Publication W095/21931), 
peptides derived from DNA binding proteins (e^g., International Patent Publication 
WO96/25508), or a cationic polymer (e.g., Intemational Patent Publication W095/2193 1). 
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It is also possible to introduce the vector in vivo as a naked DNA plasmid (see U.S. 
Patents 5,693,622, 5,589,466 and 5,580,859). Naked DNA vectors for gene therapy can be 
introduced into the desired host cells by methods known in flie art, e.g:, transfection, 
electroporation, microinjection, transduction, cell fusion, DEAE dextran, calcium 
5 phosphate precipitation, use of a gene gun, or use of a DNA vector transporter (see, Wu et 
al., 1992, supra; Wu and Wu, 1988, supra; Hartmut et al., Canadian Patent Application 
No. 2,012,311, filed March 15, 1990; WilUams et al., 1991, Proa Natl. Acad. Sci. USA 
88:2726-2730). Receptor-mediated DNA delivery approaches can also be used (Curiel et 
al., 1992, Hum. GeneTher. 3:147-154; Wuand Wu, 1987, J. Biol. Chem 262:4429-4432). 

10 "Pharmaceutically acceptable vehicle or excipient " includes diluents and fillers 

which are pharmaceutically acceptable for method of admmistration, are sterile, and may 
be aqueous or oleaginous suspensions formulated using suitable dispersing or wetting 
agents and suspending agents. The particular pharmaceutically acceptable carrier and the 
ratio of active compound to carrier are determined by the solubility and chemical properties 

IS of the composition, the particular mode of administration, and standard pharmaceutical 
practice. 

Any nucleic acid, polypeptide, vector, or host cell of the invention will preferably 
be introduced in vivo in a pharmaceutically acceptable vehicle or excipient The phrase 
"pharmaceutically acceptable" refers to molecular entities and compositions that are 

20 physiologically tolerable and do not typically produce an allergic or similar untoward 
reaction, such as gastric upset, dizziness and the like, when administered to a human. 
Preferably, as used herein, the term "pharmaceutically acceptable" means approved by a 
regulatory agency of the Federal or a state government or listed in the U.S. Pharmacopeia 
or other generally recognized pharmacopeia for use in animals, and more particularly in 

25 humans. The term "excipient" refers to a diluent, adjuvant, excipient, or vehicle with 
which the compound is administered. Such pharmaceutical carriers can be sterile liquids, 
such as water and oils, including those of petroleum, animal, vegetable or synthetic origin, 
such as peanut oil, soybean oil, mineral oil, sesame oil and the like. Water or aqueous 
solution saline solutions and aqueous dextrose and glycerol solutions are preferably 

30 employed as excipients, particularly for injectable solutions. Suitable pharmaceutical 
excipients are described in "Remington*s Pharmaceutical Sciences" by E.W. Martin. 

Naturally, the invention contemplates delivery of a vector that will express a 
therapeutically effective amount of any one of ABCA5, ABCA6, ABCA9, and ABCAIO 



r 
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polypeptides for gene therapy applications. The phrase "therapeutically effective amount" 
is used herein to mean an amount sufficient to reduce by at least about 15 percent, 
preferably by at least 50 percent, more preferably by at least 90 percent, and still more 
preferably prevent, a clinically significant deficit in the activity, fimction and response of 

5 the host. Alternatively, a therapeutically effective amount is sufficient to cause an 
improvement in a clinically significant condition in the host 

"Lipid profile" means the set of concentrations of cholesterol, triglyceride, 
lipoprotein cholesterol and other lipids in the body of a human or other animal. 

An ''undesirable lipid profile" is the condition in which the concentrations of 

10 cholesterol, triglyceride, or lipoprotein cholesterol are outside of the age- and gender- 
adjusted reference ranges. Generally, a concentration of total cholesterol > 200 mg/dl, of 
plasma triglycerides > 200 mg/dl, of LDL cholesterol > 130 mg/dl, of HDL cholesterol < 
39 mg/dl, or a ratio of total cholesterol to HDL cholesterol > 4.0 is considered to be an 
xmdesirable lipid profile. An undesirable lipid profile is associated with a variety of 

15 pathological conditions, including hyperlipida^as, diabetes hypercholesterolaemia, 
arteriosclerosis, and oflier forms of coronary artery disease. 

NUCLEIC ACroS OF THE ABCA5, A6, A9, AID GENES 

20 The applicants have identified a novel human ABCA-like cluster of genes, 

designated ABCA5, ABCA6, ABCA9, and ABCAIO. The applicants have also 
determined that the new genes are closely spaced and arranged head-to-tail in the 
following order ABCA5, ABCAIO, ABCA6, and ABCA9 on same region of 
chromosome 17q24, and encode fixU transporters (figure 1). 

25 The appUcants have also determined that each ABCA gene has a unique expression 

pattern, suggesting that the corresponding proteins may perform tissue-specialized 
fimctions (Example 3). 

In effect, expression patterns showed tiiat the 6.5 kb ABCA5 transcript is ahnost 
ubiquitous, but the strongest expression was found in testis, skeletal muscle, fetal kidney 

30 and fetal liver. The ABCA9 transcript of 6 kb was found in heart and a weak signal was 
detected in the ovary, small intestine and testis. The ABCA6 transcript of 7 kb-long was 
detected with a strong signal in the liver and fetal liver, a weaker signal was detected in 
heart, kidney (fetal and adult), lung (fetal and adult), colon, small intestine, ovary and 
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testis. No signal was detected in the same conditions in the brain (fetal and adult), 
pancreas, placenta, skeletal muscle, leukocyte, pancreas, spleen, thymus. The ABCAIO 
transcript of 6.5 kb was strongly and specifically detected in skeletal muscle and heart 
(Figure 4). 

5 Also, in situ hybridization showed ABCA9 gene strongest expression in endothelial 

cells, vascular smooth muscle, and Schwann cells, and ABCAIO gene strongest expression 
was identified consistently in macrophages, subsets of lymphocytes, and in Schwann cells 
ofnarves. 

The applicants have further determined transcript sequences that should correspond 
10 to the fall coding sequence (CDS) of ABCA5, ABCA6, and ABCA9, and ABCAIO, and 
that ABCA6, ABCA9, genes according to the invention comprise 39 exons and 38 introns, 
and ABCAIO gene according to the invention comprises at least 40 exons and 39 introns. 
Table 1 hereinafter presents splice donors and acceptors scores (Ri, bits) which are 
consistent with that of exons in other manmialian genes (Rogan et al., Hum Mutat (1998) 
15 12, 153-171). Exons are located in exactly the same positions in all genes, although the 
length of some of the exons varies. Furthermore th^e is a high correlation coefficient 
(0.990-0.997) for exon size between these genes and significant correlations (0.27-0.64) for 
some of the comparisons of intron sizes and Ri values, clearly suggesting that the genes 
fiom the 17q24 cluster arose by diqplication firom a common ancestor. 

20 

Table 1 : Correlations of exon size, intron size and Ri values. 



Exons Introns Ri-SD Ri-SA 

ABCA6-ABCA9 0.997 0.62 0.45 0.28 

ABCA6-ABCA10 0.990 0.46 0.64 0.53 

ABCA9-ABCA10 0.995 0.43 0.47 0.46 



The applicants have thus characterized new exonic sequences of human, ABCA6, 
25 ABCA9, and ABCAIO genes, which are particularly useful according to the invention for 
the production of various means of detection of corresponding, ABCA6, ABCA9, and 
ABCAIO genes, or nucleotide expression products in a sample. 

Several exons of ABCA6 gene have been characterized by their nucleotide 
sequence and are identified in Table 2. 

30 
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Table 2 : Human ABCA6 exons and intron DNA 
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186090 


4672 


4927 


56 


186091 


186225 


135 


39 


186226 


186594 


4928 


5296 


369 


186595 







Thus the present inventioii also relates to a nucleic acid comprising any one of SEQ 
ID NOs: 9-47, or a complementary sequence. 
5 The invention also relates to a nucleic acid comprising a nucleotide sequence as 

depicted in any one of SEQ ID NOs: 9-47, or a complementary nucleotide sequence. 

The invention also relates to a nucleic acid comprising at least 8 consecutive 
nucleotides of any one of SEQ ID NOs: 9-47, or a complementary nucleotide sequence. 
The subject of the invention is, in addition, a nucleic acid having at least 80% 
10 nucleotide identity with a nucleic acid comprising any one of SEQ ID NOs: 9-47, or a 
complementary nucleotide sequence. 
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The invention also relates to a nucleic acid having at least 85%, preferably 90%, 
more preferably 95% and still more preferably 98% nucleotide identity with a nucleic acid 
comprising any one of SEQ E) NOs: 9-47. 

The invention also relates to a nucleic add hybridizing, under high stringency 
5 conditions, with a nucleic add comprising any one of SEQ ID NOs: 9-47, or a 
complementary nucleotide sequence. 

Several exons of the ABCA9 gene have been characterized by their nucleotide 
sequence and are identified in Table 3. 

10 Table 3 : Human ABCA9 exons and intron DNA 
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Thus the present invention also relates to a nucleic add comprising any one of SEQ 
ID NO: 48-86, or a complementary sequence. 

The invention also relates to a nucleic acid comprising a nucleotide sequence as 
5 depicted in any one of SEQ ID NOs: 48-86, or a complementary nucleotide sequence. 

The invention also relates to a nucleic add comprising at least 8 consecutive 
nucleotides of any one of SEQ ID NOs: 48-86, or a complementary nucleotide sequence. 

The subject of the invention is, in addition, a nucleic add having at least 80% 
nucleotide identity with a nucleic add comprising any one of SEQ ID NOs: 48-86, or a 
10 complementary nucleotide sequence. 

The invention also relates to a nucleic acid having at least 85%, preferably 90%, 
more prefa-ably 95% and still more preferably 98% nucleotide identity with a nucleic acid 
comprising any one of SEQ ID NOs: 48-86. 

The invention also relates to a nucleic acid hybridizing, under high stringency 
15 conditions, with a nucleic acid comprising any one of SEQ ID NOs: 48-86, or a 
complementary nucleotide sequence. 

Several exons of the ABCAIO gene have been characterized by their nucleotide 
sequence and are identified in Table 4. 



20 
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Table 4 : Human ABCAIO exons and introns DNA 
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5 Thus, the invention also relates to a nucleic acid comprising any one of 

SEQ ID NOs: 87-126, or a complementary nucleotide sequence thereof. 

The invention also relates to a nucleic acid comprising a nucleotide sequence as 
depicted in any one of SEQ ID NOs: 87-126, or a complementary nucleotide sequence 
thereof 

10 The invention also relates to a nucleic acid comprising at least 8 consecutive 

nucleotides of any one of SEQ ID NOs: 87-126, or a complementary nucleotide sequence. 
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The subject of the invention is, in addition, a nucleic add having at least 80% 
nucleotide identity with a nucleic add comprising any one of SEQ ID NOs: 87-126, or a 
complementaiy nucleotide sequence. 

The invention also relates to a nucleic acid having at least 85%, preferably 90%, 
5 more preferably 95% and still more pref^bly 98% nucleotide identity with a nucleic acid 
comprising a) any one of SEQ ID NOs: 87-126, or a complementary nucleotide sequence. 

The invention also relates to a nucleic acid hybridizing, under high stringency 
conditions, with a nucleic add comprising any one of SEQ ID NOs: 87-126, or a 
complementary nucleotide sequence. 

0 

cDNA MOLECULES ENCODING A FULL LENGTH ABCA5, ABCA6, ABCA9, 
and ABCAIO PROTEINS 



The applicants have further determined the cDNA sequences and the full coding 
15 sequences (CDS) corresponding to human ABCA5, A6, A9, and AlO genes, which 
belong to the same chromosome 17 cluster, and encode full length human corresponding 
proteins (Example 2). 

Table 5 summarizes for each gene the mRNA length, the codingnucleotide 
sequence length as well as the protein size. 

20 

Table 5 : Characterization of the four ABCA on the chromosome 17 cluster 





mRNA 
length 

(bp) 


CDS (bp) 


Polyadenylation 
site 


Protein 
(AA) 


Number 
of coding 

exons 


ABCA5 
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4929 
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ABCA6 
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AATAAA 
(position 5284) 


1617 


38 


ABCA9 
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4875 




1624 


38 
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4632 




1543 


37 



Nd : not determined; the genomic sequence is in progress (working draft BAG). 



25 The cDNA sequence ABCA5 consists of 6525 nucleotides and contains a 4929 

nucleotide coding sequence corresponding to a 1642 amino acid (aa) ABCA5 polypeptide 
produced in subjects not affected by disorders associated with cholesterol reverse transport 
or mflammatory lipid mediators transport. The cDNA molecule of the novel human 
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ABCA5 gene having the nucleotide sequence as set forth in SEQ ID NO: 1 comprises an 
open reading fiame beginning from the nucleotide at position 1011 (base A of the ATG 
codon for initiation of translation) to the nucleotide at position 5939 (base A of the TGA 
stop codon). 

5 According to the invention, the ABCA5 cDNA comprising SEQ ID NO: 1 encodes 

a full length ABCA5 polypq)tide of 1642 amino acids comprising the amino add sequence 
ofSEQIDNO: 5. 

The cDNA molecule of the novel human ABCA6 gene having the nucleotide 
sequence as set forth in SEQ ID NO: 2 comprises an open reading frame beginning from 

10 the nucleotide at position 176 (base A of the ATG codon for initiation of translation) to the 
nucleotide at position 5029 (second base A of the TAA stop codon). A polyadenylation 
signal (having the sequence AATAAA) is present, starting from the nucleotide at 
position 5284 of the sequence SEQ ID NO: 2. 

According to the invention, the ABCA6 cDNA comprising SEQ ID NO: 2 consists 

15 of 5296 nucleotides and contains a 4854 nucleotide coding sequence which encodes a full 
length ABCA6 polypeptide of 1617 amino acids comprising the anaino acid sequence of 
SEQ ID NO: 6. 

The cDNA molecule of the novel human ABCA9 gene having the nucleotide 
sequence as set forth in SEQ ID NO: 3 comprises a coding sequence beginning from the 

20 nucleotide at position 144 (base A of the ATG codon for initiation of translation) to the 
nucleotide at position 5018 (second base A of the TAA stop codon). 

According to the invention, the ABCA9 cDNA comprising SEQ ID NO: 3 consists 
of 5959 nucleotides and contains a 4875 nucleotide coding sequence which encodes a ftill 
length ABCA9 polypeptide of 1624 amino acids comprising the amino acid sequence of 

25 SEQ ID NO: 7. 

The cDNA molecule of the novel human ABCAIO gene having the nucleotide 
sequence as set forth in SEQ ID NO: 4 comprises a coding sequence beginning from the 
nucleotide at position 880 (base A of the ATG codon for initiation of translation) to the 
nucleotide at position 551 1 (second base A of tiie TAA stop codon). 

30 According to the invention, the ABCAIO cDNA comprising SEQ ID NO: 4 consists 

of 6181 nucleotides and contains a 4632 nucleotide coding sequence which encodes a frill 
length ABCAIO polypeptide of 1543 amino acids comprising the amino acid sequence of 
SEQ ID NO: 8. 
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The preseat invention is thus directed to a nucleic acid comprising SEQ ID NOs: 1- 
4, or a complementary nucleotide sequence thereof. 

The invention also relates to a nucleic acid comprising a nucleotide sequence as 
depicted in SEQ ID NO : 1-4 or a complementary nucleotide sequence thereof. 
5 The invention also relates to a nucleic acid comprising at least eight consecutive 

nucleotides of SEQ ID NO: 1-4 or a complementary nucleotide sequence thereof. 

The subject of the invention is also a nucleic acid having at least 80% nucleotide 
identity with a nucleic acid comprising nucleotides of SEQ ID NO: 1-4 or a nucleic acid 
having a complementary nucleotide sequence thereof . 
10 The invention also relates to a nucleic acid having at least 85%, preferably 90%, 

more preferably 95% and still more preferably 98% nucleotide identity with a nucleic acid 
comprising a nucleotides of SEQ ID NO: 1-4 or a nucleic acid having a complementary 
nucleotide sequence thereof. 

Another subject of the invention is a nucleic acid hybridizing, imder high stringency 
15 conditions, with a nucleic acid comprising nucleotides of SEQ ID NO: 1-4, or a nucleic 
acid having a complementary nucleotide sequence thereof. 

The invention also relates to a nucleic acid encoding a polypeptide comprising an 
amino add sequence of SEQ ID NO: 5-8. 

The invention relates to a nucleic add encoding a polypeptide comprising an anuno 
20 acid sequence as dq)icted in SEQ ID NO:5-8. 

The invention also relates to a polypeptide comprising amino acid sequence of SEQ 
ID NO: 5-8. 

The invention also relates to a polypq>tide comprising amino add sequence as 
depicted in SEQ ID NO: 5-8. 
25 The invention also relates to a polypeptide comprising an amino add sequence 

having at least 80% amino acid identity with a polypeptide comprising an amino acid 
sequence of SEQ ID NO: 5-8, or a peptide fragment thereof 

The invention also relates to a polypeptide having at least 85%, preferably 90%, 
more preferably 95% and still more preferably 98% amino add identity with a polypeptide 
30 comprising an amino acid sequence of SEQ ID NO: 5-8. 

Preferably, a polypeptide according to the invention will have a length of 4, 5 to 10, 
15, 18 or 20 to 25, 35, 40, 50, 70, 80, 100 or 200 consecutive amino acids of a polypeptide 
according to tiie invention comprising an amino acid sequence of SEQ ID NO: 5-8. 
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Like ABCAl and ABCA4 transporters, which present 45 to 66 % amino acid 
sequences identity, ABCA5, ABCA6, ABCA9, and ABCAIO proteins also demonstrate 
high conservation as set forth in Tables 6-10 (figure 2). Alignment of the amino acid 
sequences of ABCA5, ABCA6, aBCA9 and ABCAIO genes reveals an identity ranging 
5 from 43 to 62% along the entire sequence (Table 6). Particularly, ABCA5, ABCA6, 
ABCA9, and ABCAIO proteins show 32 to 60% and 34 to 48% identity in the N- 
terminal (Table 7) and C-temainal (Table 8) trans-mraibrane domains (TMC and TMN) 
respectively, and 56 to 77% identity in the ATP-binding domains (NBDl and NBD2; 
Tables 9 and 10). 

0 

Table 6: Homology / Identity percentages between the amino acid sequences of 
ABCA5, ABCA6, ABCA8, ABCA9, ABCAIO, and ABCAl along the entire 



sequence 



Total sequence 


ABCA5 


ABCA6 


ABCAS 


ABCA9 


ABCAIO 


ABCAl 


ABCA5 


100 / 100 












ABCA6 


52.9 / 42.8 


100/100 










ABCA8 


52.4/42.4 


67/59.7 


100/100 








ABCA9 


52.6/42.7 


67.4/59.4 


78.2/71.6 


100/100 






ABCAIO 


53.2/43.4 


69.5/62.3 


68.1/61.1 


70.3 / 62.1 


100/ 100 




ABCAl 


41.5/30.8 


42.8/31 


42.832 


41.1/30.9 


41.2/30.6 


100/100 



15 Table 7: Homology / Identity percentages between the amino acid sequences of 
ABCAS, ABCA6, ABCA9, ABCAIO, ABCAl, and ABCAS in the N terminal 



transmembrane domain 



TMN domain 


ABCAS 


ABCA6 


ABCAS 


ABCA9 


ABCAIO 


ABCAl 


ABCAS 


100/100 












ABCA6 


47/34,2 


100/100 










ABCAS 


46.5/35 


70.2/59.1 


100/100 








ABCA9 


46.3/37.8 


64.2/55,7 


76.9/68.5 


100/100 






ABCAIO 


43.4 / 32.3 


68.5/60.4 


70.7/60.8 


65.5/57.9 


100/100 




ABCAl 


36.5/23.1 


34.2/20 


39.8/27.6 


40.6/27.9 


35.4 / 24.4 


100/100 



Table 8: Homology / Identity percentages between the amino acid sequences of 
20 ABCAS, ABCA6, ABCA9, ABCAIO, ABCAl and ABCA8 in the C terminal 



transmembrane domain 



TMC domain 


ABCAS 


ABCA6 


ABCAS 


ABCA9 


ABCAIO 


ABCAl 


ABCAS 


100/100 












ABCA6 


43.7/33.7 


100/100 










ABCAS 


48.2/31.8 


53.8/44.2 


100/100 








ABCA9 


47.3/33.7 


57.2/48.2 


64.1/52.9 


100/100 






ABCAIO 


47/35.4 


57/47 


54.3/43 


57.4 / 44.4 


100/100 




ABCAl 


33/21.6 


32/21.4 


39/24.8 


35.3/26.8 


34.7/22.4 


100/100 
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Table 9: Homology / Identity percentages between the amino acid sequences of 
ABCA5, ABCA6, ABCA9, ABCAIO, ABCAl, and ABCA8 in tlie nucleotide 
Binding Domain 1 (NBDl) 



NBDl domain 


ABCAS 


ABCA6 


ABCAS 


ABCA9 


ABCAIO 


ABCAl 


ABCAS 


100/100 












ABCA6 


70.2/60.3 


100/100 










ABCAS 


71.8/62.5 


85.4/78.6 


100/100 








ABCA9 


65.5/58,2 


80.2/72.8 


88.5/81.8 


100/100 






ABCAIO 


69.8/62.1 


83.2/77,2 


82.8/79.2 


81.9/75.8 


100 / 100 




ABCAl 


56.8/48,5 


53.5/43.5 


61.2/50.5 


51.7/43.8 


56.5/45.6 


100/100 



5 

Table 10 : Homology / Identity percent^es between tlie amino acid sequences of 
ABCAS, ABCA6, ABCA9, ABCAIO, ABCAl, and ABCAS in the nucleotide 
Binding Domain 2 (NBD2) 



NBD2 domain 


ABCAS 


ABCA6 


ABCAS 


ABCA9 


ABCAIO 


ABCAl 


ABCAS 


100/100 












ABCA6 


63/56.1 


100/100 










ABCAS 


66.9/58.4 


78/73.5 


100/100 








ABCA9 


65.2/57.0 


77.6/72.6 


94.5/91.8 


100/100 






ABCAIO 


63.7/56,2 


74.9/71.2 


81/77.4 


82.3/77.8 


100/100 




ABCAl 


46.4/37.8 


46.3/37.9 


46.9/38 


47.3/39 


46.4/37.7 


100/100 



10 Phylogenetic analysis of the ATP-binding domains demonstrate that the N- and 

C-tenninal domains fonn separate branches (figure 3). The C-terminal ATP-binding 
domains of the 17q24 genes are more closely related to the C-terminal domains of the 
other ABCl-like genes than to the N-terminal domains of the same proteins. Thus the 
entire ABCl subfamily appears to have arisen from a single ancestral full transporter 

15 gene. However, the genes in the 17q24 cluster foim a distinct group within the ABCl 
subfamily. 

NUCLEOTIDE PROBES AND PRIMERS 

Nucleotide probes and primers hybridizing with a nucleic acid (genomic DNA, 
20 messenger RNA,cDNA) according to the invention also form part of the invention. 

According to the invention, nucleic acid fragments derived from a polynucleotide 
comprismg any one of SEQ IDNOs: 1-4 and 9-126 or of a complementary nucleotide 
sequence are useful for the detection of the presence of at least one copy of a nucleotide 
sequence of any one of the ABCAS, ABCA6, ABCA9, and ABCAIO genes or of a 
25 fragment or of a variant (containing a mutation or a polymorphism) thereof in a sample. 
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The nucleotide probes or primers according to the invention comprise a nucleotide 
sequence comprising any one of SEQ ID NOs: 1-4 and 9-126, or a complementary 
nucleotide sequence. 

The nucleotide probes or primers according to the invention comprise at least 8 
5 consecutive nucleotides of a nucleic add comprising any one of SEQ ID NOs: 1-4 and 9- 
126, or a complementary nucleotide sequence. 

Preferably, nucleotide probes or primers according to the invention have a length of 
10, 12, 15, 18 or 20 to 25, 35, 40, 50, 70, 80, 100, 200, 500, 1000, 1500 consecutive 
nucleotides of a nucleic acid according to the invention, in particular of a nucleic acid 
10 comprising any one of SEQ ID NOs: 1-4 and 9-126, or a complementary nucleotide 
sequence. 

Alternatively, a nucleotide probe or primer according to the invention consists of 
and/or comprise the fragments having a length of 12, 15, 18, 20, 25, 35, 40, 50, 100, 200, 
500, 1000, 1500 consecutive nucleotides of a nucleic acid according to the invention, more 
15 particularly of a nucleic acid comprising any one of SEQ ID NOs: 1-4 and 9-126, or a 
complementary nucleotide sequence. 

Tlie definition of a nucleotide probe or primer according to the invention therefore 
covers oligonucleotides which hybridize, under the high stringency hybridization 
conditions defined above, with a nucleic acid comprising any one of SEQ ID NOs: 1-4 and 
20 9-126, or a complementary nucleotide sequence. 

According to a preferred embodiment, a nucleotide primer according to the 
invention comprises a nucleotide sequence of any one of SEQ ID NOs: 127-144, or a 
complementary nucleic acid sequence. 

Examples of primers and pairs of primers which make it possible to amplify various 
25 regions of the ABCA5 gene are presented in Table 1 1 below. The location of each primer 
of SEQ ID NOs: 127-144 within SEQ E) NOs: 1, and its hybridizing region is indicated in 
Table 1 1 . The abbreviation "Comp" refers to the complementary nucleic acid sequence. 



30 
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Table 11: Primers for the amplification of nucleic fragments of the ABCA5 gene 



Primer 

SEQ ID NO: 


£x)cateam 

SEQ ID NO: 


Position in 

the sequence 


127 


1 


3842-3860 


128 


1 


Comp 4858-4876 


129 


1 


Comp 4783-4801 


130 


1 


Comp 5789-5807 


131 


1 


5630-5648 


132 


1 


4858-4876 


133 


1 


Comp 3998-4016 


134 


1 


Comp 2987-3005 


135 


1 


Comp 3186-3208 


136 


1 


2528-2547 


137 


1 


Comp 3088-3107 


138 


1 


Comp 2528-2547 


139 


1 


Comp 845-862 


140 


1 


789-807 


141 




Comp 1614-1633 


142 




1614-1633 


143 




Comp 537-566 


144 




Comp 202-231 



According to a preferred embodiment, a nucleotide primer according to the 
invention comprises a nucleotide sequence of any one of SEQ ID NOs: 145-172, or a 
5 complementary nucleic acid sequence. 

Examples of primers and pairs of primers which make it possible to amplify various 
regions of the ABCA6 gene are presented in Table 12 below. The location of each primer 
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of SEQ ID NOs: 145-172 within SEQ ID NOs: 2, and its hybridizing region is indicated in 
Table 12. The abbreviation "Comp" refers to the complementary nucleic acid sequence. 



Table 12: Primers for the amplification of nucleic fragments of the ABCA6 gene 



PrimOT 
SEQ ID NO: 


Located in 
SEQ ID NO: 


Position in 
the sequence 


Region 

for hybridization 


145 


2 


202-221 


Exon2 


146 


2 


Comp 435-461 


Exon 3 


147 


2 


Comp 645-672 


Exon 5 


148 


2 


637-656 


Exon 5 


149 


2 


ISA-Ill 


Exon 6 


150 


2 


Comp 758-778 


Exon 6 


151 


2 


Comn 773-792 


Exon 6 




2 


1288-1307 


Exon 8-9 


153 


2 


1321-1341 


Exon 9 


154 


2 


Comn 1322-1343 


Exon 9 


155 


2 


Comn 1592-1574 


Exon 10 


156 


2 


1761-1782 


Exon 12 


1S7 


2 


Como 1928-1949 


Exon 13 


158 


2 


1944-1968 


Exon 13-14 


159 


2 


Comp 2041-2061 


Exon 14 


160 


2 


Comp 2371-2392 


Exon 17 


161 


2 


2350-2371 


Exon 17 


162 


2 


2806-2884 


Exon 20 


163 


2 


Comp 2884-2902 


Exon 20 


164 


2 


3292-3313 


Exon 23-24 
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165 


2 


Comp 3357-3339 


Exon24 


166 


2 


Comp 3746-3767 


Exon27 


167 


2 


3754-3775 


Exon 27 


168 


2 


4176-4194 


Bxonil 


169 


2 


Comp 4248-4194 


Exon 31-32 


170 


2 


4743-4763 


Exon 36 


171 


2 


Comp 4796-4778 


Exon 36-37 


172 


2 


Comp 5262-5244 


Exon 39 



According to a preferred embodiment, a nucleotide primer according to the 
invention comprises a nucleotide sequence of any one of SEQ ID NOs: 173-203, or a 
complementary nucleic acid sequence. 
5 Examples of primers and pairs of primers which make it possible to amplify various 

regions of the ABCA9 gene are presmted in Table 13 below. The location of each primer 
of SEQ ID NOs: 173-203 within SEQ ID NOs: 3, and its hybridizing region is indicated in 
Table 13, The abbreviation "Comp" refers to the complementary nucleic acid sequence, 

10 Table 13: Primers for the amplification of nucleic fragments of the ABCA9 gene 



Primer 
SEQ ID NO: 


Located in 
SEQ ID NO: 


Position in 
the sequence 


Region 

for hybridization 


173 


3 


160-178 


Exon 2 


174 


3 


Comp 789-808 


Exon 6 


175 


3 


786-804 


Exon 6 


176 


3 


Comp 1434-1455 


Exon 10 


177 


3 


1305-1323 


Exon 9 


178 


3 


Comp 1632-1653 


Exon 11-12 


179 


3 


1495-1516 


Exon 10 
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180 


3 


1866-1887 


Exon 13 


181 


3 


Comp 1905-1923 


Exon 13 


182 


3 


Comp 2349-2368 


Exon 17 


183 


3 


2253-2272 


Exon 17 


184 


3 


Comp 2822-2843 


Exon 20 


185 


3 


2645-2663 


Exon 19 


186 


3 


Comp 3089-3110 


Exon 22 


187 


3 


3240-3260 


Exon 23 


188 


3 


3023-3044 


Exon 21 


189 


3 


Comp 3801-3820 


Exon 28 


190 


3 


Comp 3377-3398 


Exon 24 


191 


3 


3626-3646 


Exon 26 


192 


3 


Comp 4191-4209 


Exon 32 


193 


3 


3964-3984 


Exon 29-30 


194 


3 


Comp 4784-4803 


Exon 37 


195 


3 


5230-5247 


Exon 39 


196 


3 


4694-4715 


Exon 36 


197 


3 


Comp 4977-4994 


Exon 39 


198 


3 


5541-5561 


Exon 39 


199 


3 


Comp 5960-5981 


Exon 39 


200 


3 


Comp5541-5562 


Exon 39 


201 


3 


24-45 


Exonl 


202 


3 


Comp 384-408 


Exon 3 


203 


3 


Comp 311-337 


Exon 3 
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According to a preferred embodiment, a nucleotide primer according to the 
invention comprises a nucleotide sequence of any one of SEQ ID NOs: 204-217, or a 
complementary nucleic acid sequence thereof. 

Examples of primers and pairs of primers which make it possible to amplify various 
5 regions of the ABCAl 0 gene are presented in Table 14 below. The location of each primer 
of SEQ ID NOs: 204-217 within SEQ ID NOs: 4, and its hybridizing region is indicated in 
Table 14. The abbreviation "Comp" refers to the complementary nucleic acid sequmce. 



Table 14: Primers for the amplification of nucleic fragments of the ABCAIO gene 



Primer 


Located in 

0£>\^ XL/ lNV/« 


Position in 


Region 

■fbr hvbridization 


204 


4 


1421-1440 


Exon 8 


205 


4 


Comp 1610-1629 


Exon9 


206 


4 


2417-2434 


Exon 15 


207 


4 


Comp 2605-2623 


Exon 16 


208 


4 


Comp 3737-3754 


Exon 24 


209 


4 


Comp 814-839 


Exon 4 


210 


4 


Comp 733-757 


Exon 4 


211 


4 


61-86 


Exon 1 


212 


4 


628-643 


Exon 3 


213 


4 


3564-3583 


Exon 23 


214 


4 


Comp 4450-4468 


Exon 30-31 


215 


4 


Comp 5442-5459 


Exon 40 


216 


4 


3050-3070 


Exon 20 


217 


4 


Comp 4848-4866 


Exon 34 
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According to a specific embodiment of preferred probes and primers according to 
the invention, they comprise all or part of a nucleotide sequence comprising any one of 
SEQ ID NOs: 127-217, or a nucleic acid having a complementary nucleic acid sequence. 

A nucleotide primer or probe according to the invention may be prepared by any 
5 suitable method well known to persons skilled in the art, including by cloning and action of 
restriction enzymes or by direct chemical synthesis according to techniques such as the 
phosphodiester method by Narangetal. (1979, Methods Enzymol, 68:90-98) or by 
Brown et al. (1979, Methods Enzymol 68:109-151% the diethylphosphoramidite method by 
Beaucage et al. (1981, Tetrahedron Lett, 22: 1859-1862) or the technique on a solid support 
10 described in EU patent No. EP 0,707,592. 

Each of the nucleic acids according to the invention, including the oligonucleotide 
probes and primers described above, may be labeled, if desired, by incorporating a marker 
which can be detected by spectroscopic, photochemical, biochemical, immunochemical or 

32 33 

chemical means. For example, such markers may consist of radioactive isotopes ( P, P, 

15 ^H, ^^S), fluorescent molecules (5-bromodeoxyuridine, fluorescein, acetylaminofluorene, 
digoxigenin) or ligands such as biotin. The labeling of the probes is preferably carried out 
by incorporating labeled molecules into the polynucleotides by primer extension, or 
alternatively by addition to the 5' or 3' ends. Examples of nonradioactive labeling of 
nucleic acid fragments are described in particular in French patent No. 78 109 75 or in the 

20 articles by Urdeaetal. (1988, Nucleic Acids Research, 11:4937-4957) or Sanchez- 
pescador et al. (1988, J. Clin. Microbiol., 26(10):1934-1938). 

Preferably, the nucleotide probes and primers according to the invention may have 
structural diaracteristics of the type to allow amplification of the signal, such as the probes 
described by Urdea et al. (1991, Nucleic Acids Symp Ser., 24:197-200) or alternatively in 

25 European patent No. EP-0,225,807 (CHIRON). 

The oligonucleotide probes according to the invention may be used in particular in 
Southern-type hybridizations with the genomic DNA or alternatively in northern-type 
hybridizations with the corresponding messenger UNA when the expression of the 
corresponding transcript is sought in a sample. 

30 The probes and primers according to the invention may also be used for the 

detection of products of PCR amplification or alternatively for the detection of mismatches. 

Nucleotide probes or primers according to the invention may be immobilized on a 
solid support. Such solid supports are well known to parsons skilled in the art and comprise 



wo 02/46458 PCT/EPOl/15401 

68 

surfaces of wells of nucrotiter plates, polystyrene beads, magaetic beagds, nitrocellulose 
bands or microparticles such as latex particles. 

Consequently, the present invention also relates to a method of detecting the 
presence of a nucleic add comprising a nucleotide sequence of any one of SEQ ID NOs: I- 
5 4 and 9-126, or of a complementary nucleotide sequence, or a nucleic acid fragment or 
variant of any one of SEQ ID NOs: 1-4 and 9-126, or of a complementary nucleotide 
sequence in a sample, said method comprising the steps of: 

1) bringing one or more nucleotide probes or primers according to the invention 
into contact with the sample to be tested; 
10 2) detecting the complex which may have foraied between the probe(s) and the 

nucleic acid present in the sample. 

Accordiag to a specific embodiment of the method of detection according to the 
invention, the oligonucleotide probes and primers are immobilized on a support. 

According to another aspect, the oligonucleotide probes and primers comprise a 
15 detectable marker. 

The invention relates, in addition, to a box or kit for detecting the presence of a 
nucleic acid according to the invention in a sample, said box or kit comprising: 

a) one or more nucleotide probe(s) or primer(s) as described above; 

b) where appropriate, the reagents necessary for the hybridization reaction. 

20 According to a first aspect, the detection box or kit is characterized in that the 

probe(s) or primer(s) are immobilized on a support. 

According to a second aspect, the detection box or kit is chamcterized in that the 
oligonucleotide probes comprise a detectable marker. 

According to a specific embodiment of the detection kit described above, such a kit 

25 will comprise a plurality of oligonucleotide probes and/or primers in accordance with the 
invention which may be used to detect a target nucleic add of interest or alternatively to 
detect mutations in the coding regions or the non-coding regions of the nucleic acids 
according to the invention, more particularly of nucleic acids comprising any one of 
SEQ ID NOs: 1-4 and 9-126, or a complementary nucleotide sequence. 

30 Thus, the probes according to the invention, immobilized on a support, may be 

ordered into matrices such as "DNA chips". Such ordered matrices have in particular been 
described in US patent No. 5,143,854, in published PCT applications WO 90/15070 and 
WO 92/10092. 
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Support matrices on which oligonucleotide probes have been immobilized at a high 
density are for example described in US patent No. 5,412,087 and in published PCX 
application WO 95/11995. 

The nucleotide primes according to the invention may be used to amplify any one 
5 of the nucleic adds according to the invention, and more particularly a nucleic acid 
comprising a nucleotide sequence of any one of SEQ IDNOs: 1-4 and 9-126, or of a 
complementary nucleotide sequence. Altematively, the nucleotide primers according to the 
invention may be used to amplify a nucleic acid fragment or variant of any one of 
SEQ JD NOs: 1-4 and 9-126, or of a complementary nucleotide sequence. 
10 In a particular embodiment, the nucleotide primers according to the invention may 

be used to amplify a nucleic acid comprising any one of SEQ ID NOs: 1-4 and 9-126, or as 
depicted in any one of SEQ ID NOs: 1-4 and 9-126, or of a complementary nucleotide 
sequence. 

Another subject of the invention relates to a method of amplifying a nucleic acid 
15 according to the invention, and more particularly a nucleic acid comprising a) any one of 
SEQ ID NOs: 1-4 and 9-126, or a complementary nucleotide sequence, b) as depicted in 
any one of SEQ ID NOs: 1-4 and 9-126, or of a complementary nucleotide sequence, 
contained in a sample, said method comprising the steps of: 

a) bringing the sample in which the presence of the target nucleic add is suspected 
20 into contact with a pair of nucleotide primers whose hybridization position is located 

respectively on the 5' side and on the 3' side of the region of the target nucleic add whose 
amplification is sought, in the presence of the reagents necessary for the ampUfication 
reaction; and 

b) detecting the amplified nucleic adds. 

25 To carry out the amplification method as defined above, use will be preferably 

made of any of the nucleotide primers described above. 

The subject of the invention is, in addition, a box or kit for amplifying a nucleic 

acid according to the invention, and more particularly a nucleic acid comprising any one of 

SEQ IDNOs: 1-4 and 9-126, or a complementary nucleotide sequence, or as depicted in 
30 any one of SEQ ID NOs: 1-4 and 9-126, or of a complementary nucleotide sequence, said 

box or kit comprising: 
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a) a pair of nucleotide primers in accordance with the invention^ whose 
hybridization position is located respectively on the 5' side and 3' side of the target nucleic 
acid whose amplification is sought; and optionally, 

b) reagmts necessary for the amplification reaction. 

5 Such an amplification box or kit will preferably comprise at least one pair of 

nucleotide primers as described above. 

The subject of the invention is, in addition, a box or kit for amplifying all or part of 
a nucleic acid comprising any one of SEQ ID NOs: 1-4 and 9-126, or a complementary 
nucleotide sequence, said box or kit comprising: 
10 1) a pair of nucleotide primers in accordance with the invention, whose 

hybridization position is located respectively on the 5' side and 3' side of the target nucleic 
acid whose amplification is sought; and optionally, 

2) reagents necessary for an amplification reaction. 

Such an amplification box or kit will preferably comprise at least one pair of 
15 nucleotide primers as described above. 

The invention also relates to a box or kit for detecting the presence of a nucleic acid 
according to the invention in a sample, said box or kit comprising: 

a) one or more nucleotide probes according to the invention; 

b) where appropriate, reagents necessary for a hybridization reaction. 

20 According to a first aspect, the detection box or kit is characterized in that the 

nucleotide probe(s) and primer(s)are immobilized on a support: 

According to a second aspect, the detection box or kit is characterized in that the 
nucleotide probe(s) and primer(s) comprise a detectable marker. 

According to a specific embodiment of the detection kit described above, such a kit 

25 will comprise a plurality of oligonucleotide probes and/or primes in accordance with the 
invention which may be used to detect target nucleic acids of interest or alternatively to 
detect mutations in the coding regions or the non-coding regions of the nucleic acids 
according to the invention. According to preferred embodiment of the invention, the target 
nucleic acid comprises a nucleotide sequence of any one of SEQ DD NOs: 1-4 and 9-126, or 

30 of a complementary nucleic acid sequence. Alternatively, the target nucleic acid is a 
nucleic acid firagment or variant of a nucleic acid comprising any one of SEQ ID NOs: 1-4 
and 9-126, or of a complementary nucleotide sequence. 
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According to the present invention, a primer according to the invention comprises, 
generally, all or part of any one of SEQ ID NOs: 127-217, or a complementary sequence. 

The nucleotide primers according to the invention are particularly usefid in methods 
of gmotyping subjects and/or of genotyping populations, in particular in the context of 
5 studies of association between particular allele forms or particular forms of groups of 
alleles (haplotypes) in subjects and the existence of a particular phenotype (character) in 
these subjects, for example the predisposition of these subjects to develop diseases linked 
to a deficiency of cholesterol reverse transport and inflammation signaling lipids, or 
alternatively the predisposition of these subjects to develop a pathology whose candidate 
10 chromosomal region is situated on chromosome 17, more precisely on the 17q arm and still 
more precisely in the 17q24 locus. 

RECOMBINANT VECTORS 

The invention also relates to a recombinant vector comprising a nucleic acid 
15 according to the invention. "Vector" for the purposes of the present invention will be 
understood to mean a circular or linear DNA or RNA molecule which is either in single- 
stranded or double-stranded form. 

Preferably, such a recombinant vector will comprise a nucleic acid chosen fix)m the 
following nucleic acids: 

20 a) a nucleic acid comprising a nucleotide sequence of any one of SEQ ID NOs: 1-4 

and 9-126, or of a complementary nucleotide sequence, 

b) a nucleic acid comprising a nucleotide sequmce as depicted in any one of SEQ 
ID NOs: 1-4 and 9-126, or of a complementary nucleotide sequence, 

c) a nucleic acid having at least eight consecutive nucleotides of a nucleic acid 
25 comprising a nucleotide sequence of any one of SEQ ID NOs: 1-4 and 9-126, or of a 

complementary nucleotide sequence; 

d) a nucleic acid having at least 80% nucleotide identity with a nucleic acid 
comprising a nucleotide sequence of any one of SEQ ID NOs: 1-4 and 9-126, or a 
complementary nucleotide sequence; 

30 e) a nucleic acid having 85%, 90%, 95%, or 98% nucleotide identity with a nucleic 

acid comprising a nucleotide sequence of any one of SEQ ED NOs: 1-4 and 9-126, or a 
complementary nucleotide sequence; 
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f) a nucleic acid hybridizing, under high stringency hybridization conditions, with a 
nucleic acid comprising a nucleotide sequence of 1) any one of SEQ HJNOs: 1-4 and 9- 
126, or a complemoitary nucleotide sequence; 

g) a nucleic add encoding a polypeptide comprising an amino acid sequence of 
5 SEQ ID NO: 5-8; and 

h) a nucleic acid encoding a polypeptide comprising anoino acid sequence selected 
from SEQ ID NO: 5-8. 

According to a first embodiment, a recombinant vector according to the invention is 
used to amplify a nucleic acid inserted therein, following transformation or transfection of 
10 a desired cellular host 

According to a second embodiment, a recombinant vector according to the 
invention corresponds to an expression vector comprising, in addition to a nucleic acid in 
accordance with the invention, a regulatory signal or nucleotide sequence that directs or 
controls transcription and/or translation of the nucleic acid and its encoded mRNA. 
15 According to a preferred embodiment, a recombinant vector according to ihe 

invention will comprise in particular the following components: 

(1) an element or signal for regulating the expression of the nucldc acid to be 
inserted, such as a promoter and/or enhancer sequence; 

(2) a nucleotide coding region comprised within the nucleic acid in accordance with 
20 the invention to be inserted into such a vector, said coding region being placed in phase 

with the regulatory element or signal described in (1); and 

(3) an appropriate nucleic add for initiation and termination of transcription of the 
nucleotide coding region of the nucleic add described m (2). 

In addition, the recombinant vectors according to the invention may include one or 
25 more origins for replication in the cellular hosts in which their amplification or their 
expression is sought, markers or selectable markers. 

By way of example, the bacterial promoters may be the Lad or LacZ promoters, the 
T3 or T7 bacteriophage RNA polymerase promoters, the lambda phage PR or PL 
promoters. 

30 The promoters for eukaryotic cells will comprise the herpes simplex virus (HSV) 

virus thymidine kinase promoter or alternatively the mouse metallofhionein-L promoter. 

Generally, for the choice of a suitable promoter, persons skilled in the art can 
preferably refer to the book by Sambrook et al. (1989, Molecular cloning: a laboratory 
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manual 2ed. Cold Spring Harbor Laboratory, Cold spring Harbor, New York) cited 
above or to the techniques described by Fuller et al. (1996, Immunology, In: Current 
Protocols in Molecular Biology^ Ausubel et al.(eds.). 

When the expression of the genomic sequence of any one of flie ABCA5, ABCA6, 
5 ABCA9, and ABCAIO genes will be sought, use will preferably be made of the vectors 
capable of containing large insertion sequences. In a particular embodiment, bacteriophage 
vectors such as the PI bacteriophage vectors such as the vector pi 5 8 or the vector 
pl58/neo8 described by Sternberg (1992, Trends Genet, 8:1-16; 1994, Mamm. Genome, 
5:397-404) will be preferably used. 
10 The preferred bacterial vectors according to the invention are for example the 

vectors pBR322(ATCC37017) or alternatively vectors such as pAA223-3 (Pharmacia, 
Uppsala, Sweden), and pGEMl (Promega Biotech, Madison, WI, UNITED STATES). 

There may also be cited other commercially available vectors such as the vectors 
pQE70, PQE60, pQE9 (Qiagen), psiX174, pBluescript SA, pNH8A, p]SIH16A, pNHlSA, 
15 pNH46A, pWLNEO, pSV2CAT, pOG44, pXTI, pSG (Stratagene). 

They may also be vectors of the baculovirus type such as the vector pVL1392/1393 
(Pharmingen) used to transfect cells of the Sf9 line (ATCC No. CRL 1711) derived from 
Spodoptera frugiperda. 

They may also be adenoviral vectors such as the human adenovirus of type 2 or 5. 
20 A recombinant vector according to the invention may also be a retroviral vector or 

an adeno-associated vector (AAV). Such adeno-associated vectors are for example 
described by Flotte et al. (1992, Am. J. Respir. Cell Mol. Biol., 7:349-356), Samulski et al. 
(1989, J. Virol, 63:3822-3828), or McLaughlin BA et al. (1996, Am. J. Hum, Genet., 59:561- 
569). 

25 To allow the expression of a polynucleotide according to the invention, the latter 

must be introduced into a host cell. The mtroduction of a polynucleotide according to the 
invention into a host cell may be carried out in vitro, accordmg to the techniques well 
known to persons skilled in the art for transforming or transfecting cells, either in primer 
culture, or in the form of cell lines. It is also possible to carry out the introduction of a 

30 polynucleotide according to the invention in vivo or ex vivo, for the prevention or treatment 
of diseases linked to ABC A5, A6, A9 or AlO deficiencies . 
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To introduce a polynucleotide or vector of the invention into a host cell, a person 
skilled in the art can pTef&cably refer to various techniques, such as the calcium phosphate 
precipitation technique (Graham et al., 1973, Virology, 52:456-457 ; Chen et al., 1987, 
Mol. CeU. BioL, 7 : 2745-2752), DEAE Dextran (Gopal, 1985, Mol. CeU. Biol., 5:1188- 
5 1190), electroporation (Tur-Kaspa, 1896, Mol. Cell. BioL, 6:716-718 ; Potter et al., 1984, 
Proc Natl Acad Sci U S A., 81(22):7 161-5), dhrect microinjection (Harland et al., 1985, J. 
CeU. BioL, 101:1094-1095), liposomes charged with DNA (Nicolau et al., 1982, Methods 
EnzymoL, 149:157-76; Fraley et al., 1979, Proc. Natl. Acad. Sci. USA, 76:3348-3352). 

Once the polynucleotide has been introduced into the host cell, it may be stably 

10 integrated into the genome of the cell. The intregration may be achieved at a precise site of 
the genome, by homologous recombination, or it may be randomly integrated. In some 
embodiments, the polynucleotide may be stably maintained in the host cell in the form of 
an episome fragment, the episome comprising sequences allowing the retention and the 
replication of the latter, eitha: independently, or in a synchronized manner with the cell 

15 cycle. 

According to a specific embodiment, a method of introducing a polynucleotide 
according to the invention into a host cell, in particular a host cell obtained from a 
mammal, in vivo, comprises a step during which a preparation comprising a 
pharmaceutically compatible vector and a "naked" polynucleotide according to the 
20 invention, placed under the control of appropriate regulatory sequences, is introduced by 
local injection at the level of the chosen tissue, for example myocardial tissue, the "naked" 
polynucleotide being absorbed by the myocytes of tins tissue. 

Compositions for use in vitro and in vivo comprising "naked" polynucleotides are 
for example described in PCX Application No. WO 95/11307 (Institut Pasteur, Inserm, 
25 University of Ottawa) as well as in the articles by Tacson et al. (1996, Nature Medicine, 
2(8):888-892) and Huygen et al. (1996, Nature Medicine, 2(8):893-898) . 

According to a specific embodiment of the invention, a composition is provided for 
the in vivo production of any one of ABCA5, ABCA6, ABCA9, and ABCAIO proteins. 
This composition comprises a polynucleotide encoding the ABCA5, ABCA6, ABCA9, and 
30 ABCAIO polypeptides placed under the control of appropriate regulatory sequences, in 
solution in a physiologically accq)table vector. 

The quantity of vector which is injected into the host organism chosen varies 
according to the site of the injection. As a guide, there may be injected between about 0.1 
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and about 100 ixg of polynucleotide encoding the ABCA5, ABCA6, ABCA9, and ABCAIO 
proteins into the body of an animal, preferably into a patient likely to develop a disease 
linked ABC A5, A6, A9 or AlO deficiencies Consequently, the invention also relates 
to a pharmaceutical composition intended for the prevention of or treatment of a patient or 
5 subject affected by ABC A5, A6, A9 or AlO deficiencies, comprising a nucleic acid 
encoding any one of the ABCA5, ABCA6, ABCA9, and ABCAIO proteins, in combination 
with one or more physiologically compatible excipients. 

Preferably, such a composition will comprise a nucleic acid comprising a nucleotide 
sequence of any one of SEQ ID NO: 1-4, wherein the nucleic acid is placed under the 

10 control of an appropriate regulatory element or signal. 

The subject of the invention is, in addition, a pharmaceutical composition intended 
for the prevention of or treatment of a patient or a subject affected ABC A5, A6, A9 or 
AlO deficiencies, comprising a recombinant vector according to the invention, in 
combination with one or more physiologically compatible excipients. 

15 The invention also relates to the use of a nucleic acid according to the invention, 

encoding any one of the ABCA5, ABCA6, ABCA9, and ABCAIO proteins, for the 
manufacture of a medicament intended for the prevention of arteriosclerosis in various 
forms or more particularly for the treatment of subjects affected by a dj^fimction of 
cholesterol reverse transport or inflammatory liphophilic substances transport. 

20 The invention also relates to tfie use of a recombinant vector according to the 

invention, comprising a nucleic acid encoding any one of the ABCA5, ABCA6, ABCA9, 
and ABCAIO protdns, for the manufacture of a medicament intended for the prevention of 
arteriosclerosis in various forms or more particularly for tiie treatment of subjects affected 
by a dysfunction of cholesterol reverse transport or inflammatory liphophilic substances 

25 transport. 

The subject of the invention is therefore also a recombinant vector comprising a 
nucleic acid according to the invention that encodes any one of ABCA5, ABCA6, ABCA9, 
and ABCAIO proteins or polypeptides.. 

The invention also relates to the use of such a recombinant vector for the 
30 preparation of a pharmaceutical composition intended for the treatment and/or for the 
prevention of diseases or conditions associated with deficiency of cholesterol reverse 
transport or inflanGunatory liphophilic substances transport. 
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The present invention also relates to the use of cells genetically modified ex vivo 
with such a recombinant vector according to the invention, or of cells producing a 
recombinant vector, wherein the cells are implanted in the body, to allow a prolonged and 
effective expression in vivo of at least a biologically active ABCA5, ABCA6, ABCA9, and 
5 ABCAIO polypeptides. 

Vectors useful in methods of somatic gene therapy and compositions containing 
such vectors. 

The present invention also relates to a new therapeutic approach for the treatment of 
pathologies linked to ABC A5, A65 A9 or AlO deficiencies.. It provides an advantageous 

10 solution to the disadvantages of the prior art, by demonstrating the possibility of treating 
the pathologies ABC A5, A6, A9 or AlO deficiencies by gene therapy, by the transfer and 
expression in vivo of a gene encoding at least one of ABCA5, ABCA6, ABCA9, and 
ABCAIO proteins involved in the transport of lipophilic substances. The invention thus 
offers a simple means allowing a specific and effective treatment of related pathologies 

15 such as, for example, arteriosclerosis, inflammatioxi, cardiovascular diseases, metabolic 
diseases, lipophilic substances related pathologies. 

Gene therapy consists in correcting a deficiency or an abnormality (mutation, 
aberrant expression and the like) and in bringing about the expression of a protein of 
therapeutic interest by introducing genetic information into the affected cell or organ. This 

20 genetic information may be introduced either ex vivo into a cell extracted firom the organ, 
the modified cell then being reintroduced into the body, or directly in vivo into the 
appropriate tissue. Id this second case, various techniques edst, among which various 
transfection techniques involvmg complexes of DNA and DEAE-dextran (Pagano et al. 
(1967. J. Virol., 1:891), of DNA and nuclear proteins (Kaneda et al., 1989, Science 

25 243:375), of DNA and lipids (Feigner et al., 1987, PNAS 84:7413), the use of Uposomes 
(Fraley et al., 1980, JBioLChem., 255:10431), and the like. More recently, the use of viruses 
as vectors for the transfer of genes has appeared as a promising alternative to these physical 
transfection techniques. In this regard, various viruses have been tested for their capacity to 
infect certain cell populations. In particular, the retroviruses (RSV, HMS, MMS, and the 

30 like), the HSV virus, the adeno-associated viruses and the adenoviruses. 

The present invention therefore also relates to a new therapeutic approach for the 
treatment of pathologies linked to ABC A5, A6, A9 or AlO deficiencies, consisting in 
transfOTing and in expressing in vivo genes encoding any one of ABCA5, ABCA6, 
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ABCA9, and ABCAIO. In a particularly preferred manner, the applicant has now found 
that it is possible to construct recombinant vectors comprising a nucleic acid encoding at 
least one of ABCA5, ABCA6, ABCA9, and ABCAIO proteins, to administer these 
recombinant vectors in vfvo, and that this administration allows a stable and effective 

5 expression of at least one of the biologically active ABCA5, ABCA6, ABCA9, and 
ABCAIO proteins in v/vo, with no cytopathological effect 

Adenoviruses constitute particularly efficient vectors for the transfer and the 
expression of any one of the ABCA5, ABCA6, ABCA9, and ABCAIO genes. The use of 
recombinant adenoviruses as vectors makes it possible to obtain sufficiently high levels of 

10 expression of this gene to produce the desired therapeutic effect. Other viral vectors such as 
retroviruses or adeno-associated viruses (AAV) can allow a stable expression of the gene 
are also claimed. 

The present invention is thus likely to offer a new approach for the treatment and 

prevention of ABC A5, A6, A9 or AID deficiencies . 
15 The subject of the invention is therefore also a defective recombinant virus 

comprising a nucleic acid according to the invention that encodes at least one ABCAS, 

ABCA6, ABCA9, and ABCAIO proteins or polypeptides involved in the metabolism of 

lipophilic substances. 

The invention also relates to the use of such a defective recombinant virus for the 
20 preparation of a phamiaceutical composition which may be useful for the treatment and/or 

for the prevention of ABC A5, A6, A9 or AlO deficiencies . 

The present invention also relates to the nse of cells genetically modified ex vivo 

with such a defective recombinant virus according to the invention, or of cells producing a 

defective recombinant virus, wherein the cells are implanted in the body, to allow a 
25 prolonged and effective expression in vivo of at least one biologically active ABCAS, 

ABCA6, ABCA9, and ABCAIO polypeptides. 

The present invention is particularly advantageous because it is it possible to 

induce a controlled expression, and with no harmful effect, of ABCAS, ABCA6, ABCA9, 

and ABCAIO in organs which are not normally involved in the expression of this protein. 
30 In particular, a significant release of the ABCAS, ABCA6, ABCA9, and ABCAIO proteins 

is obtained by implantation of cells producing vectors of the invention, or infected ex vivo 

with vectors of the invention. 
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The activity of these ABC protein transporters produced in the context of the 
present mvention may be of the human or animal ABCA5, ABCA6, ABCA9, and 
ABCAIO type. The nucleic sequence used in the context of the present invention may be a 
cDNA, a genomic DNA (gDNA), an RNA (in the case of retroviruses) or a hybrid 
5 construct consisting, for example, of a cDNA into which one or more introns (gJDNA) 
would be inserted. It may also involve synthetic or semisyathetic sequences. In a 
particularly advantageous manner, a cDNA or a gDNA is used. In particular, the use of a 
gDNA allows a better expression in human cells. To allow their incorporation into a viral 
vector according to the invention, these sequences are preferably modified, for example by 

10 site-directed mutagenesis, in particular for the insertion of appropriate restriction sites. The 
sequences described in the prior art are indeed not constructed for use according to the 
invention, and prior adaptations may prove necessary, in order to obtain substantial 
expressions. In the context of the present invention, the use of a nucleic sequence encoding 
any one of human ABCA5, ABCA6, ABCA9, and ABCAIO proteins is preferred. 

15 Moreover, it is also possible to use a construct encoding a derivative of any one of ABCA5, 
ABCA6, ABCA9, and ABCAIO proteins. A derivative of any one of ABCA5, ABCA6, 
ABCA9, and ABCAIO proteins comprises, for example, any sequence obtained by 
mutation, deletion and/or addition relative to the native sequence, and eucoding a product 
retaining the lipophilic subtances transport activity. These modifications may be made by 

20 techniques known to a person skilled in the art (see general molecular biological techniques 
below). The biological activity of the derivatives thus obtained can then be easily 
determined, as indicated in particular in the examples of the measurement of the efflux of 
the substrate firom cells. The derivatives for the purposes of the invention may also be 
obtained by hybridization firom nucleic add libraries, using as probe the native sequence or 

25 a firagment thereof. 

These derivatives are in particular molecules having a higher affinity for their 
binding sites, molecules exhibiting greater resistance to proteases, molecules having a 
higher therapeutic efficacy or fewer side effects, or optionally new biological properties. 
The derivatives also include the modified DNA sequences allowing improved expression 

30 in vivo. 

In a first embodiment, the present invention relates to a defective recombinant 
vims comprising a cDNA encoding any one of ABCA5, ABCA6, ABCA9, and ABCAIO 
polypeptides. In anotiier preferred embodiment of the invention, a defective recombinant 
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virus comprises a genomic DNA (gDNA) encoding any one of ABCA5, ABCA6, ABCA9, 
and ABCAIO polypeptides. Preferably, the ABCA5, ABCA6, ABCA9, and ABCAIO 
polypeptides comprise an amino acid sequence selected fiom SEQ ID NO:5-8, 
respectively. 

5 The vectors of the invention may be prepared from various types of viruses. 

Preferably, vectors derived from adenoviruses, adeno-associated viruses (AAV), 
herpesviruses (HSV) or retroviruses are used. It is preferable to use an adenovirus, for 
direct administration or for the ex vivo modification of cells intended to be implanted, or a 
retrovirus, for the implantation of producing cells. 

10 The viruses according to the invention are defective, that is to say that they are 

incapable of autonomously rqplicating in the target cell. Generally, the genome of the 
defective viruses used in the context of the present invention therefore lacks at least the 
sequences necessary for the replication of said virus in the infected cell. These regions may 
be either eliminated (completely or partially), or made non functional, or substituted with 

15 other sequence and in particular with the nucleic sequence encoding any one of the 
ABCA5, ABCA6, ABCA9, and ABCAIO proteins. Preferably, the defective virus retains, 
nevertheless, the sequences of its genome which are necessary for the encapsidation of the 
viral particles. 

As regards more particularly adenoviruses, various serotypes, whose structure and 
20 properties vary somewhat, have been characterized. Among these serotypes, human 
adenoviruses of type 2 or 5 (Ad 2 or Ad 5) or adenoviruses of animal "origin (see 
Application WO 94/26914) are preferably used in the context of the present invention. 
Among the adenoviruses of animal origin which can be used in the context of the present 
invention, there may be mentioned adenoviruses of canine, bovine, murine (example: 
25 Mavl, Beard et al.. Virology 75 (1990) 81), ovine, porcine, avian or simian (example: 
SAV) origin. Preferably, the adenovirus of animal origin is a canine adenovirus, more 
preferably a CAV2 adenovirus [Manhattan or A26/61 strain (ATCC VR-800) for example]. 
Preferably, adenoviruses of human or canine or mixed origin are used in the context of the 
invention. Preferably, the defective adenoviruses of the invention comprise the ITRs, a 
30 sequence allowing the encapsidation and the sequence encoding any one of the ABCA5, 
ABCA6, ABCA9, and ABCAIO proteins. Preferably, in die genome of the adenoviruses of 
the invention, the El region at least is made non functional. Still more preferably, in the 
genome of the adenoviruses of the inv^ition, the El gene and at least one of the E2, E4 and 
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L1-L5 genes are non functional. The viral gene considered may be made non functional by 
any technique known to a person skilled in the art, and in particular by total suppression, by 
substitution, by partial deletion or by addition of one or more bases in the gene(s) 
considered. Such modifications may be obtained in vitro (on the isolated DNA) or in situ, 

5 for example, by means of genetic engineering techniques, or by treatment by means of 
mutagenic agents. Other regions may also be modified, and in particular tiie E3 
(WO95/02697), E2 (W094/28938), E4 (W094/28152, W094/12649, WO95/02697) and 
L5 (WO95/02697) region. According to a preferred embodiment, the adenovirus according 
to the invention comprises a deletion in the El and E4 regions and the sequence encoding 

10 any one of ABCA5, ABCA6, ABCA9, and ABCAIO is inserted at the level of the 
inactivated El region. According to another preferred embodiment, it comprises a deletion 
in the El region at the level of v/hich the E4 region and the sequence encoding any one of 
ABCA5, ABCA6, ABCA9, and ABCAIO (French Patent Application FR94 13355) are 
inserted. 

15 The defective recombinant adenoviruses according to the invention may be 

prepared by any technique known to persons skilled in the art (Levrero et al., 1991 Gene 
101; EP 185 573; and Graham, 1984, EMBO J., 3:2917). In particular, they may be prepared 
by homologous recombination between an adenovirus and a plasmid carrying, inter alia, 
the nucleic add encoding any one of the ABCA5, ABCA6, ABCA9, and ABCAIO 

20 proteins. The homologous recombination occurs after cotransfection of said adenoviruses 
and plasmid into an appropriate cell Une. The cell line used must preferably (i) be 
transformable by said elements, and (ii), contain the sequences capable of complementing 
the part of the defective adenovirus genome, preferably in integrated form in order to avoid 
the risks of recombination. By way of example of a line, there may be mentioned the 

25 human embryonic kidney line 293 (Graham et al., 1977, J. Gen. Virol., 36:59), which 
contains in particular, integrated into its genome, the left part of the genome of an Ad5 
adenovirus (12%) or lines capable of complementing the El and E4 functions as described 
in particular in Applications No. WO 94/26914 and WO95/02697. 

As regards the adeno-associated viruses (AAV), they are DNA viruses of a 

30 relatively small size, which integrate into the genome of the cells which they infect, in a 
stable and site-specific manner. They are capable of infecting a broad spectrum of cells, 
without inducing any effect on cellular growth, morphology or differentiation. Moreover, 
they do not appear to be involved in pathologies in humans. The genome of AAVs has 
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been cloned, sequenced and characterized. It comprises about 4700 bases, and contains at 
each end an inverted repeat region (ITR) of about 145 bases, serving as replication origin 
for the virus. The remainder of the genome is divided into 2 essential regions carrying the 
encapsidation functions: the left hand part of the genome, which contains the rep gene, 
5 involved in the viral rq)lication and the expression of the viral genes; the right hand part of 
the genome, which contains the cap gene encoding the virus capsid proteins. 

The use of vectors derived from AAVs for the transfer of genes in vitro and in 
vivo has been described in the literature (see in particular WO 91/18088; WO 93/09239; 
US 4,797,368, US5,139,941, EP 488 528). These applications describe various constructs 

10 derived from AAVs, in which the rep and/or cap genes are deleted and replaced by a gene 
of interest, and their use for transferring in vitro (on cells in culture) or in vivo (directly into 
an organism) said gene of iaterest. However, none of these documents either describes or 
suggests the use of a recombinant AAV for the transfer and expression in vivo or ex vivo of 
any one of ABCA5, ABCA6, ABCA9, and ABCAIO proteins, or the advantages of such a 

15 transfer. The defective recombinant AAVs according to the invention may be prepared by 
cotransfection, into a cell line infected with a human helper virus (for example an 
adenovirus), of a plasmid containing the sequence encoding any one of the ABCA5, 
ABCA6, ABCA9, and ABCAIO proteins bordered by two AAV inverted repeat regions 
(TTR), and of a plasmid carrying the AAV encapsidation genes (rep and cap genes). The 

20 recombinant AAVs produced are flien purified by conventional techniques. 

As regards the herpesviruses and the retroviruses, the construction of recombinant 
vectors has been widely described in the literature: see in particular Breakfield et al., 
(1991. New Biologist, 3:203); EP 453242, EP178220, Bemstem et al. (1985); McCormick, 
(1985. BioTechnology, 3:689), and the like. 

25 In particular, the retroviruses are integrating viruses, infecting dividing cells. The 

genome of the retroviruses essentially comprises two long terminal repeats (LTRs), an 
encapsidation sequence and three coding regions (gag, pol and env). In the recombinant 
vectors derived from retroviruses, the gag, pol and env genes are generally deleted, 
completely or partially, and replaced with a heterologous nucleic acid sequence of interest. 

30 These vectors may be produced from various types of retroviruses such as in particular 
MoMuLV ("murine moloney leukemia virus"; also called MoMLV), MSV ("murine 
moloney sarcoma virus"), HaSV ("harvey sarcoma virus"); SNV ("spleen necrosis virus"); 
RSV ("rous sarcoma vims") or Friend's virus. 
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To construct recombinant retroviruses containing a sequence encoding any one fo 
the ABCA5, ABCA6, ABCA9, and ABCAIO proteins according to the invention, a 
plasmid containing in particular the LTRs, the encapsidation sequence and said coding 
sequence is generally constructed, and then used to transfect a so-called encapsidation cell 
5 line, capable of providing in trans the retroviral functions deficient in the plasmid. 
Generally, the encapsidation lines are therefore capable of expressing the gag, pol and env 
genes. Such encapsidation lines have been described in the prior art, and in particular ttie 
PA317 line (US 4,861,719), the PsiCRIP line (WO 90 /02806) and the GP+envAm-12 line 
(WO 89/07150). Moreover, the recombinant retroviruses may contain modifications at the 
10 level of the LTRs in order to suppress the transcriptional activity, as well as extended 
encapsidation sequences, containing a portion of the gag gene (Bender et al., 1987, J. 
Virol, 61 : 1 639). The recombinant retroviruses produced are then purified by conventional 
techniques. 

To carry out the present invention, it is preferable to use a defective recombinant 

15 adenovirus. The particularly advantageous properties of adenoviruses are preferred for the 
in vivo expression of a protein having a lipophilic subtrate transport activity. The 
adenoviral vectors according to the invention are particularly preferred for a direct 
administration in vivo of a purified suspension, or for the ex vivo transformation of cells, in 
particular autologous cells, in view of their implantation. FurthOTUore, the adenoviral 

20 vectors according to tiie invention exhibit, in addition, considerable advantages, such as in 
particular their very high infection eflSciency, which makes it possible to carry out 
infections using small volumes of viral suspension. 

According to another particularly preferred embodiment of the invention, a line 
producing retroviral vectors containing the sequence encoding any one of the ABCA5, 

25 ABCA6, ABCA9, and ABCAIO proteins is used for implantation in vivo. The lines which 
can be used to this end are in particular the PA317 (US 4,861,719), PsiCrip 
(WO 90/02806) and GP+envAm-12 (US 5,278,056) cells modified so as to allow the 
production of a retrovirus containing a nucleic sequence encoding any one of ABCA5, 
ABCA6, ABCA9, and ABCAIO proteins according to the invention. For example, 

30 totipotent stem cells, precursors of blood cell lines, may be collected and isolated fix)m a 
subject. These cells, when cultured, may then be transfected with the retroviral vector 
containing the sequence encoding any one of ABCA5, ABCA6, ABCA9, and ABCAIO 
proteins under the control of viral, nonviral or nonviral promoters specific for macrophages 
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or under the control of its own promoter. These cells are then reintroduced into the subject. 
The differentiation of these cells will be responsible for blood cells expressing at least one 
of ABCA5, ABCA6, ABCA9, and ABCAIO proteins. 

Preferably, in the vectors of the invention, the sequence encoding any one of the 
5 ABCA5, ABCA6, ABCA9, and ABCAIO proteins is placed under the control of signals 
allowing its expression in the infected cells. These may be expression signals which are 
homologous or heterologous, that is to say signals different from those which are naturally 
responsible for the expression of the ABCA5, ABCA6, ABCA9, and ABCAIO proteins. 
They may also be in particular sequences responsible for the expression of other proteins, 

10 or synthetic sequences. In particular, they may be sequences of eukaryotic or viral genes or 
derived sequences, stimulating or repressing the transcription of a gene in a specific 
maimer or otherwise and in an inducible manner or otherwise. By way of example, they 
may be promoter sequences derived from the genome of the cell which it is desired to 
infect, or from the genome of a virus, and in particular the promoters of the El A or major 

15 late promoter (MLP) genes of adenoviruses, the cytomegalovirus (CMV) promoter, the 
RS V-LTR and the Uke. Among the eukaryotic promoters, there may also be mentioned the 
ubiquitous promoters (HPRT, vimentin, a-actin, tubulin and the like), the promoters of the 
intermediate filaments (desmin, neurofilaments, keratin, GFAP, and the like), the 
promoters of therapeutic genes (of the MDR, CFTR or factor VIII type, and the like), 

20 tissue-specific promoters (pyruvate kinase, villin, promoter of the fatty acid binding 
intestinal protein, promoter of the smooth muscle cell a-actin, promoters specific for the 
liver; Apo AI, Apo AH, human albumin and the like) or promoters corresponding to a 
stimulus (steroid hormone receptor, retinoic acid receptor and the like). In addition, these 
expression sequences may be modified by addition of enhancer or regulatory sequences and 

25 the like. Moreover, when the inserted gene does not contain expression sequences, it may 
be inserted into the genome of the defective virus downstream of such a sequence. 

In a specific embodiment, the invention relates to a defective recombinant virus 
comprising a nucleic acid encoding any one of ABCA5, ABCA6, ABCA9, and ABCAIO 
proteins the control of a promoter chosen from RSV-LTR or the CMV early promoter. 

30 As indicated above, the present invention also relates to any use of a virus as 

described above for the preparation of a pharmaceutical composition for the treatment 
and/or prevention of pathologies linked to the transport of lipophilic substances. 
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The present invention also relates to a pharmaceutical composition comprising 
one or more defective recombinant viruses as described above. These pharmaceutical 
compositions may be fonnulated for administration by the topical, oral, parenteral, 
intranasal, intravenous, intramuscular, subcutaneous, intraocular or transdermal route and 
5 the like. Preferably, the pharmaceutical compositions of the invention comprises a 
pharmaceutically acceptable vehicle or phj^iologically compatible excipient for an 
injectable formulation, in particular for an intravenous injection, such as for example into 
the patient's portal vein* These may relate in particular to isotonic sterile solutions or dry, 
in particular, freeze-dried, compositions which, upon addition depending on the case of 
10 sterilized water or physiological saline, allow the preparation of injectable solutions. Direct 
injection into the patient's portal vein is preferred because it makes it possible to target the 
infection at the level of the liver and thus to concentrate the therapeutic effect at the level 
of this organ. 

The doses of defective recombinant virus used for the injection may be adjusted as 
15 a function of various parameters, and in particular as a function of the viral vector, of the 
mode of administration used, of the relevant pathology or of the desired duration of. 
treatment In general, the recombinant adenoviruses according to the invention are 
formulated and administered in the form of doses of between 10^ and 10l4 pfu/ml, and 
preferably 10^ to 10^0 pfu/ml. The term "pfii'' (plaque forming imit) corresponds to the 
20 infectivity of a virus solution, and is determined by infecting an appropriate cell culture and 
measuring, generally after 48 hours, the number of plaques that result from infected cell 
lysis. The techniques for determining the pfii titer of a viral solution are well documented 
in the literature. 

As regards retroviruses, the compositions according to the invention may directly 
25 contain the producing cells, with a view to their implantation, 

In this regard, another subject of the invention relates to any mammalian cell 
infected with one or more defective recombinant viruses according to the invention. More 
particularly, the invention relates to any population of human cells infected with such 
viruses. These may be in particular cells of blood origin (totipotent stem cells or 
30 precursors), fibroblasts, myoblasts, hepatocytes, keratinocytes, smooth muscle and 
endothelial cells, gjial cells and the like. 



wo 02/46458 PC T/EPO 1/15401 

85 

The cells according to the invention may be derived from primary cultures. These 
may be collected by any technique known to persons skilled in the art and then cultured 
under conditions allowing their proliferation. As regards more particularly fibroblasts, 
these may be easily obtained from biopsies, for example according to the technique 
5 described by Ham (1980). These cells may be used directly for infection with the viruses, 
or stored, for example by freezing, for the establishment of autologous libraries, in view of 
a subsequent use. The cells according to the invention may be secondary cultures, obtained 
for example from pre-established libraries (see for example EP 228458, EP 289034, EP 
400047, EP 456640). 

10 The cells in culture are then infected with a recombinant virus according to the 

invention, in order to confer on them the capacity to produce at least one biologically active 
ABCA5, ABCA6, ABCA9, and ABCAIO proteins. The infection is carried out in vitro 
according to techniques known to persons skilled in the art. In particular, depending on the 
type of cells used and the desired number of copies of virus per cell, persons skilled in the 

15 art can adjust the multiplicity of infection and optionally the number of infectious cycles 
produced. It is clearly understood that these steps must be carried out under appropriate 
conditions of sterility when the cells are intended for administration in vivo. The doses of 
recombinant virus used for the infection of the cells may be adjusted by persons skilled in 
the art according to the desired aim. The conditions described above for the administration 

20 in vivo may be applied to the infection in vitro. For the infection with a retrovirus, it is also 
possible to co-culture a cell to be infected with a cell producing the recombinant retrovirus 
according to the invention. This makes it possible to eliminate purification of the 
retrovirus. 

Another subject of the invention relates to an implant comprising mammalian 
25 cells infected with one or more defective recombinant viruses according to the invention or 
cells producing recombinant viruses, and an extracellular matrix. Preferably, the implants 
according to the invention comprise 10^ to lO^O cells. More preferably, they comprise 10^ 
to 108 cells. 

More particularly, in ttie implants of the invention, the extracellular matrix 
30 comprises a gelling compound and optionally a support allowing the anchomge of tihe cells. 

For flie preparation of the implants accordmg to the invention, various types of 
gelling agents may be used. The gelling agents are used for the inclusion of the cells in a 



wo 02/46458 PCT/EPOl/15401 

86 

matrix having the constitution of a gel, and for promoting the anchorage of the cells on the 
support, where appropriate. Various cell adhesion agents can therefore be used as gelling 
agents, such as in particular collagen, gelatin, glycosaminoglycans, fibronectin, lectins and 
the like. Preferably, collagen is used in the context of the present invention. This may be 

5 collagen of human, bovine or murine origin. More preferably, type I collagen is used. 

As indicated above, the compositions according to the invention preferably 
comprise a support allowing the anchorage of flie cells. The term anchorage designates any 
form of biological and/or chemical and/or physical intemction causing the adhesion and/or 
the attachment of the cells to the support. Moreover, the cells may either cover the support 

10 used, or penetrate inside this support, or both. It is preferable to use in the context of flie 
invention a solid, nontoxic and^or biocompatible support. In particular, it is possible to use 
polytetrafluoroethylene (PTFE) fibers or a support of biological origin. 

The present invention thus offers a very effective means for the treatment or 
prevention of pathologies linked to the transport of lipophilic substances. 

15 lu addition, this treatment may be applied to both humans and any animals such as 

ovines, bovines, domestic animals (dogs, cats and the like), horses, fish and the like. 

RECOMBINANT HOST CELLS 

The invention relates to a recombinant host cell comprising a nucleic acid of the 

20 invention, and more particularly, a nucleic add comprising a nucleotide sequence selected 
from SEQ ID NO: 1-4 and 9-126, or a complementary nucleotide sequence thereof 

The invention also relates to a recombinant host cell comprising a nucldc acid of 
the invention, and more particularly a nucleic add comprising a nucleotide sequence as 
depicted in SEQ ID NO: 1 -4 and 9-126, or a complementary nucleotide sequence thereof 

25 According to another aspect, the invention also relates to a recombinant host cell 

comprising a recombinant vector according to the invention. Therefore, the invention also 
relates to a recombinant host cell comprising a recombinant vector comprising any of the 
nucleic acids of the invention, and more particularly a nucleic acid comprising a nucleotide 
sequence of selected firom SEQ ID NO: 1-4 and 9-126, or a complementary nucleotide 

30 sequence thereof 

The invention also relates to a recombinant host cell comprising a recombinant 
vector comprising a nucleic acid comprising a nucleotide sequence as depicted in any one 
of SEQ ID NOs: 1-4 and 9-126, or of a complementary nucleotide sequence thereof 
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The preferred host cells according to the invention are for example the following: 
a) prokaxyotic host cells: strains of Escherichia coU (strain DH5-a), of Bacillus 

subtiliSy of Salmonella typhimurium, or strains of genera such as Pseudomonas, 

Streptomyces and Staphylococus ; 
5 b) eukaiyotic host cells: HeLa cells (ATCC No. CCL2), Cv 1 cells (ATCC No. 

CCL70), COS cells (ATCC No. CRL 1650), Sf-9 cells (ATCC No. CRL 1711), CHO ceUs 

(ATCC No. CCI^61) or 3T3 cells (ATCC No. CRL-6361). 

METHODS FOR PRODUCING ABCA5, ABCA6, ABCA9, and ABCAIO 
10 POLYPEPTroES 

The invention also relates to a method for the production of a polypeptide 
comprising an amino acid sequence of any one of SEQ ID NOs: 5-8, said method 
comprising the steps of: 

a) inserting a nucleic acid encoding said polypeptide into an appropriate vector; 
15 b) culturing, in an appropriate culture medium, a previously transformed host cell or 

transfecting a host cell with the recombinant vector of step a); 

c) recovering the conditioned culture medium or lysing the host cell, for example by 
sonication or by osmotic shock; 

d) separating and purifying said polypeptide from said culture medium or 
20 alternatively from the cell lysates obtained in step c); and 

e) where appropriate, characterizing the recombinant polypeptide produced. 

The polypeptides according to the invention may be characterized by binding to an 
immunoafEmity chromatography column on which the antibodies directed against this 
polypeptide or against a fragment or a variant thereof have been previously immobilized. 
25 According to another aspect, a recombinant polypeptide according to the invention 

may be purified by passing it over an appropriate series of chromatography columns, 
according to methods known to persons skilled in the art and described for example in F. 
Ausubel et al (1989, Current Protocols in Molecular Biology, Green Publishing Associates and 
Wiley Ihterscience, N.Y). 

30 A polypeptide according to the invention may also be prepared by conventional 

chemical synthesis techniques either in homogeneous solution or in solid phase. By way of 
illustration, a polypeptide according to the invention may be prepared by the technique 
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eifh^ in homogeneoiis solution described by Houben Weyl (1974, Meuthode der 
Organischen Chemie, E. Wunsch Ed., 15-1:15-11) or the solid phase synthesis technique 
described by Merrifield (1965, Nature, 207(996):522-523; 1965, Science, 150(693):178-185). 
A polypeptide termed "homologous" to a polypeptide having an amino acid 

5 sequence selected from SEQ ID NO: 5-8 also forms part of the invention. Such a 
homologous polypeptide comprises an amino acid sequence possessing one or more 
substitutions of an amino add by an equivalent amino acid of SEQ ID NO:5-8. 

An "equivalait amino acid" according to the present invention will be understood 
to mean for example replacement of a residue in the L form by a residue in the D form or 

10 the replacement of a glutamic acid (E) by a pyro-glutamic acid according to techniques 
well known to persons skilled in the art. By way of illustration, the synthesis of peptide 
containing at least one residue in the D form is described by Koch (1977). According to 
another aspect, two amino acids belonging to the same class, that is to say two uncharged 
polar, nonpolar, basic or acidic amino acids, are also considered as equivalent amino acids. 

15 Polypeptides comprising at least one nonpeptide bond such as a retro-inverse bond 

(NHCO), a carba bond (CH2CH2) or a ketomethylene bond (CO-CH2) also form part of tiie 
invention. 

Preferably, the polypeptides according to the invention comprising one or more 
additions, deletions, substitutions of at least one amino acid will retain their capacity to be 
20 recognized by antibodies directed against the nonmodified polypeptides. 

ANTIBODIES 

The ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides according to the 
invention, in particular 1) a polypeptide comprising an amino acid sequence of any one of 

25 SEQ ID NOs: 5-8, 2) a polypeptide fragment or variant of a polypeptide comprising an 
amino acid sequence of any one of SEQ ID NOs: 5-8, or 3) a polypeptide termed 
'^homologous" to a polypeptide comprising amino acid sequence selected from SEQ ID 
NOs: 5-8, may be used for the preparation of an antibody, in particular for detecting the 
production of a normal or altered form of ABCA5, ABCA6, ABCA9, and ABCAIO 

30 polypeptides in a patient. 

An antibody directed against a polypeptide termed '^homologous" to a polypeptide 
having an amino acid sequence selected from SEQ ID NO: 5-8 also forms part of the 
invention. Such an antibody is directed against a homologous polypeptide comprising an 
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amino acid sequence possessing one or more substitutions of an amino acid by an 
equivalent amino acid of SEQ JD NO: 5-8. 

"Antibody*' for the purposes of the present invention will be understood to mean in 
particular polyclonal or monoclonal antibodies or fragments (for example flie F(ab)'2 and 
5 Fab fragments) or any polypeptide comprising a domain of the initial antibody recognizing 
the target polypeptide or polypeptide fragment according to the invention. 

Monoclonal antibodies may be prepared from hybridomas according to the 
technique described by Kohler and MQstein (1975, Nature, 256:495-497). 

According to the invention, a polypeptide produced recombinantly or by chemical 

10 synthesis, and fragments or other derivatives or analogs thereof, including fusion proteins, 
may be used as an immunogen to generate antibodies that recognize a polypeptide 
according to the invention. Such antibodies include but are not limited to polyclonal, 
monoclonal, chimeric, single chain, Fab fragments, and an Fab expression library. The 
anti-ABCA5, anti-ABCA6, anti-ABCA9, and anti-ABCAlO antibodies of the invention 

15 may be cross reactive, e,g,, they may recognize corresponding ABCA5, ABCA6, ABCA9, 
and ABCAIO polypeptides from different species. Polyclonal antibodies have greater 
likelihood of cross reactivity. Alternatively, an antibody of the invention may be specific 
for a single fonn of any one of ABCA5, ABCA6, ABCA9, and ABCAIO. Preferably, such 
an antibody is specific for any one of human ABCA5, ABCA6, ABCA9, and ABCAIO. 

20 Various procedxires known in the art may be used for the production of polyclonal 

antibodies to any one of ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides or 
derivatives or analogs thereof. For the production of antibody, various host animals can be 
immunized by injection with any one of ABCA5, ABCA6, ABCA9, and ABCAIO 
polypeptides, or a draivatives {e.g., fragment or fiision protein) thereof, including but not 

25 limited to rabbits, mice, rats, sheep, goats, etc. In one embodiment, any one of the 
ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides or fragments thereof can be 
conjugated to an immunogenic carrier, e.g., bovine serum albumin (BSA) or keyhole 
limpet hemocyanin (KLH). Various adjuvants may be used to increase the immunological 
response, depending on the host species, including but not limited to Freund's (complete 

30 and incomplete), mineral gels such as aluminum hydroxide, surface active substances such 
as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanins, dinitrophenol, and potentially useftil human adjuvants such as BCG (bacille 
Calmette-Guerin) and Corynebacterium parvum. 
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For preparation of monoclonal antibodies directed toward any one of ABCA5, 
ABCA6, ABCA9, and ABCAIO polypeptides, or fragments, analogs, or derivatives 
thereof, any technique that provides for the production of antibody molecules by 
continuous cell lines in culture may be used. These include but are not limited to the 

5 hybridoma technique originally developed by Kohler and Milstdn (1975, Nature, 256:495- 
497), as well as the trioma technique, the human B-cell hybridoma technique (Kozbor et al., 
1983, Immunology Today, 4:72; Cote et al. 1983, Proc. Nati. Acad. Sd. U.SA. 80:2026- 
2030), and the EBV-hybridoma technique to produce human monoclonal antibodies (Cole 
et al., 1985, In: Monoclonal Antibodies and Cancer Therapy, AlanR. Liss, Inc., pp. 77-96). 

10 In an additional embodiment of the invention, monoclonal antibodies can be produced in 
gOTU-free animals (WO 89/12690). In fact, according to the invention, techniques 
developed for the production of "chimeric antibodies" (Morrison et al., 1984, J. Bacteriol. 
159:870; Neuberger et al., 1984, Nature, 312:604-608; Takeda et al., 1985, Nature 
3 14:452-454) by splicing the genes from a mouse antibody molecule specific for any one of 

15 ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides together with genes from a human 
antibody molecule of appropriate biological activity can be used; such antibodies are within 
the scope of this mvention. Such human or humanized chimeric antibodies are preferred 
for use in therapy of human diseases or disorders (described infra), since the human or 
humanized antibodies are much less likely than xenogenic antibodies to induce an immxme 

20 response, in particular an allergic response, themselves. 

According to the invention, techniques described for the production of single chain 
antibodies (U.S. Patent Nos. 5,476,786 and 5,132,405 to Huston; U.S. Patent 4,946,778) 
can be adapted to produce ABCA5, ABCA6, ABCA9, and ABCAIO polypeptide-spedfic 
single chain antibodies. An additional embodiment of the invention utilizes the techniques 

25 described for the construction of Fab expression libraries (Huse et al., 1989, Science 
246:1275-1281) to allow rapid and easy identification of monoclonal Fab firagments with 
the desired specificity for any one of ABCA5, ABCA6, ABCA9, and ABCAIO 
polypeptides, or its derivatives, or analogs. 

Antibody fragments which contain the idiotype of the antibody molecule can be 

30 generated by known techniques. For example, such fragments include but are not limited 
to: the F(ab')2 fragment which can be produced by pepsin digestion of the antibody 
molecule; the Fab' fragments which can be generated by reducing the disulfide bridges of 
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the F(ab')2 fragment, and the Fab fragments which can be generated by treatmg the 
antibody molecule with papain and a reducing agent. 

In the production of antibodies, scre^ung for the desired antibody can be 
accomplished by techniques known in the art, e.g,, radioimmunoassay, ELISA (enzyme- 
5 linked intmiunosorbant assay), "sandwich" inmiunoassays, immunoradiometric assays, gel 
diffusion precipitin reactions, immunodifiusion assays, in situ innmunoassays (using 
colloidal gold, enzyme or radioisotope labels, for example), western blots, precipitation 
reactions, agglutination assays (e.g., gel agglutination assays, hemagglutination assays), 
complement fixation assays, immunofluorescence assays, protein A assays, and 

10 inmiunoelectrophoresis assays, etc. In one embodiment, antibody binding is detected by 
detecting a label on the primary antibody. In another embodiment, the primary antibody is 
detected by detecting binding of a secondary antibody or reagent to the primary antibody. 
In a further embodiment, the secondary antibody is labelled. Many means are known in the 
art for detecting binding in an immunoassay and are within the scope of the present 

15 invention. For example, to select antibodies which recognize a specific epitope of any one 
of ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides, one may assay generated 
hybridomas for a product which binds to any one of ABCA5, ABCA6, ABCA9, and 
ABCAIO polypeptide firagments containing such epitope. For selection of an antibody 
specific to any one of of ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides fix)m a 

20 particular species of animal, one can select on the basis of positive bmding with any one of 
ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides ejqpressed by or isolated from cells 
of that species of animal. 

The foregoing antibodies can be used in methods known in the art relating to the 
localization and activity of any one of ABCA5, ABCA6, ABCA9, and ABCAIO 

25 polypeptides, e.g., for Western blotting, ABCA5, ABCA6, ABCA9, and ABCAIO 
polypeptides in situ, measuring levels thereof in appropriate physiological samples, etc. 
using any of the detection techniques mentioned above or known in the art. 

In a specific embodiment, antibodies that agonize or antagonize the activity of any 
one of ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides can be generated. Such 

30 antibodies can be tested using the assays described infra for identifying ligands. 

The present invention relates to an antibody directed against 1) a polypeptide 
comprising an amino acid sequence of any one of SEQ ID NOs: 5-8, 2) a polypeptide 
fragment or variant of a polypeptide comprising an amino acid sequence of any one of 
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SEQ JD NOs: 5-8, or 3) a polypeptide tOTned 'liomologous" to a polypeptide comprising 
amino acid sequence selected from SEQ ID NO:5-8, also forms part of the invention, as 
produced in the trioma technique or the hybridoma technique described by Kozbor et al. 
(1983, Hybridoma, 2(1):7-16). 
5 The invention also relates to single-chain Fv antibody fragments (ScFv) as 

described in US patent No. 4,946,778 or by Martineau et al. (1998, J Mol Biol, 280(1):117- 
127). 

The antibodies according to the invention also comprise antibody fragments 
obtained with the aid of phage libraries as described by Ridder et al., (1995, Biotechnology 
10 (NY), 13(3):255-260) or humanized antibodies as described by Reinmaim et al. (1997, 
AIDS Res Hum Retrovimses, 13(ll):933-943) and Leger et al., (1997, Hum Antibodies, 8(1):3- 
16). 

The antibody preparations according to the invention are usefiil in immunological 
detection tests intended for the identification of the presence and/or of the quantity of 
1 5 antig^ present in a sample. 

An antibody according to the invention may comprise, in addition, a detectable 
marker which is isotopic or nonisotopic, for example fluorescent, or may be coupled to a 
molecule such as biotin, according to techniques well known to persons skilled in the art. 
Thus, the subject of the invention is, in addition, a method of detecting the presence 
20 of a polypeptide according to flie invention in a sample, said method comprising the steps 
of: 

a) bringing the sample to be tested mto contact with an antibody directed against 1) 
a polypeptide comprising an amino add sequence of any one of SEQ ID NOs: 5-8, 2) a 
polypeptide fragment or variant of a polypeptide comprising an amino acid sequence of any 

25 one of SEQ ID NOs: 5-8, or 3) a polypeptide teraied '"homologous" to a polypeptide 
comprising amino acid sequence selected from SEQ ID NOs: 5-8, and 

b) detecting the antigen/antibody complex formed. 

The invention also relates to a box or kit for diagnosis or for detecting the presence 
of a polypeptide in accordance with the invention in a sample, said box comprising: 
30 a) an antibody directed against 1) a polypeptide comprising an amino acid sequence 

of any one of SEQ ID NOs:5-8, 2) a polypeptide fragment or variant of a polypeptide 
comprising an amino acid sequence of any one of SEQ ID NOs: 5-8, or 3) a polypeptide 
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termed 'liomologous" to a polypeptide comprising amino acid sequence selected from SEQ 
ID NOs: 5-8, and 

b) a reagent allowing the detection of the antigen/antibody complexes formed. 

5 PHARMACEUTICAL COMPOSITIONS AND THERAPEUTIC METHODS OF 
TREATMENT 

The invention also relates to pharmaceutical compositions intended for the 
prevention and/or treatment of a deficiency in the transport of cholesterol or inflammatory 
lipid substances, characterized in that they comprise a therapeutically effective quantity of 
10 a polynucleotide capable of giving rise to the production of an effective quantity of at least 
one of ABCA5, ABCA6, ABCA9, and ABCAIO fiinctional polypeptides, in particular a 
polypeptide comprising an amino acid sequence of SEQ ID NOs: 5-8. 

The invention also provides pharmaceutical compositions comprising a nucleic acid 
encoding any one of ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides according to 
15 the invention and pharmaceutical compositions comprising any one of ABCA5, ABCA6, 
ABCA9, and ABCAIO polypeptides according to the invention intended for the prevention 
and/or treatment of diseases linked to a deficiency in the transport of cholesterol or 
inflammatory lipid substances. 

The present invention also relates to a new therapeutic approach for the treatment of 
20 pathologies linked to the transport of lipophilic substances, comprising transferring and 
expressing in vivo nucleic acids encoding at least one of ABCA5, ABCA6, ABCA9, and 
ABCAIO proteins according to the invention. 

Thus, the present invention offers a new approach for the treatment and/or the 
prevention of pathologies linked to the abnormalities of the transport of lipophilic 
25 substances. 

Consequently, the invention also relates to a pharmaceutical composition intended 
for the prevention of or treatment of subjects affected by, a dysfunction in lipophilic 
substances, comprising a nucleic acid encoding at least one of the ABCA5, ABCA6, 
ABCA9, and ABCAIO proteins, in combination wifli one or more physiologically 
30 compatible vehicle and/or excipient. 

According to a specific embodiment of the invention, a composition is provided for 
tiie in vivo production of at least one of the ABCA5, ABCA6, ABCA9, and ABCAIO 
proteins. This composition comprises a nucleic acid encoding any one of the ABCA5, 
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ABCA6, ABCA9, and ABCAIO polypeptides placed under the control of appropriate 
regulatory sequences, in solution in a physiologically acceptable vehicle and/or excipient 

Therefore, the present invention also relates to a composition comprising a nucleic 
acid encoding a polypeptide comprising an amino acid sequence of SEQ ID NOs: 5-8, 
5 wherein the nucleic add is placed under the control of appropriate regulatory elements. 

Preferably, such a composition will comprise a nucleic acid comprising a nucleotide 
sequence of SEQ ID NOs: 1-4, placed under the control of appropriate regulatory elements. 

According to another aspect, the subject of the invention is also a preventive and/or 
curative therapeutic method of treating diseases caused by a deficiency in the transport of 
10 lipophilic substances, such a method comprising a step in which there is administered to a 
patient a nucleic acid encoding any one of the ABCA5, ABCA6, ABCA9, and ABCAIO 
polypeptides according to the invention in said patient, said nucleic acid being, where 
appropriate, combined with one or more physiologically compatible vehicles and/or 
excipients. 

15 The invention also relates to a pharmaceutical composition intended for the 

prevention of or treatment of subjects affected by, a dysfunction in the transport of 
lipophilic substances, comprising a recombinant vector according to the invention, in 
combination with one or more physiologically compatible excipients. 

According to a specific ^bodiment, a method of introducing a nucleic acid 

20 according to the invention into a host cell, in particular a host cell obtained fix)m a 
mammal, in vivo, comprises a step during which a preparation, comprising a 
phaimaceutically compatible vector and a **naked" nucleic acid according to the invention, 
placed under the control of appropriate regulatory sequences, is introduced by local 
injection at the level of the chosen tissue, for example a smooth muscle tissue, the "naked" 

25 nucleic acid being absorbed by the cells of this tissue. 

The invention also relates to the use of a nucleic acid according to the invention, 
encoding any one of the ABCA5, ABCA6, ABCA9, and ABCAIO proteins, for the 
manufacture of a medicament intended for the prevention and/or treatment in various forms 
or more particularly for the treatment of subjects affected by a dysfimction in the transport 

30 of lipophilic substances. 

The invention also relates to the use of a recombinant vector according to the 
invention, comprising a nucleic acid encoding any one of the ABCA5, ABCA6, ABCA9, 
and ABCAIO proteins, for the manufacture of a medicament intended for the prevention 
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and/or treatment of subjects affected by a dysfunction in the transport of lipophilic 
substances. 

As indicated above, the present invention also relates to the use of a defective 
recombinant virus according to the invention for the preparation of a pharmaceutical 

5 composition for the treatment and/or prevention of pathologies linked to the transport of 
lipophilic substances. 

The invention relates to the use of such a defective recombinant virus for the 
preparation of a pharmaceutical composition intended for the treatment and/or prevention 
of a deficiency associated with the transport of lipophilic substances. Thus, the present 

10 invention also relates to a phannaceutical composition comprising one or more defective 
recombinant viruses according to the invention. 

The present invention also relates to the use of cells genetically modified ex ^dvo 
with a virus according to the invention, or of producing cells such as viruses, implanted in 
the body, allowing a prolonged and effective expression in vivo of at least one biologically 

15 active ABCA5, ABCA6, ABCA9, and ABCAIO proteins. 

The present invention shows that it is possible to incorporate a nucleic acid 
encoding any one of the ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides into a viral 
vector, and that these vectors make it possible to effectively express a biologically active, 
mature form. More particularly, the invention shows that the in vivo expression of any one 

20 of ABCA5, 'ABCA6, ABCA9, and ABCAIO genes may be obtained by direct 
administration of an adenovirus or by implantation of a producing cell or of a cell 
genetically modified by an adenovirus or by a retrovirus incorporating such a DNA. 

Preferably, the pharmaceutical compositions of the invention comprise a 
pharmaceutically acceptable vehicle or physiologically compatible excipient for an 

25 injectable formulation, in particular for an intravenous injection, such as for example into 
the patient's portal vein. These may relate in particular to isotonic sterile solutions or dry, 
in particular, fi:'eeze-dried, compositions which, upon addition depending on the case of 
sterilized water or physiological saline, allow the preparation of injectable solutions. Direct 
injection into the patient's portal vein is preferred because it makes it possible to target the 

30 infection at the level of the liver and thus to concentrate the therapeutic effect at the level 
of this organ. 

A "pharmaceutically acceptable vehicle or excipient" includes diluents and fillers 
which are pharmaceutically acceptable for method of administration, are sterile, and may 
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be aqueous or oleaginous suspensions formulated using suitable dispersing or wetting 
agents and suspending agents. The particular phannaceutically acceptable carrier and the 
ratio of active compound to carrier are deteraained by the solubility and chemical properties 
of the composition, the particular mode of administration, and standard phannaceutical 
5 practice. 

Any nucleic acid, polypeptide, vector, or host cell of the invention will preferably 
be introduced in vivo m a phannaceutically acceptable vehicle or exdpient The phrase 
"phannaceutically acceptable" refers to molecular entities and compositions that are 
physiologically tolerable and do not typically produce an allergic or similar untoward 

10 reaction, such as gastric upset, dizziness and the like, when administered to a human. 
Preferably, as used herein, the term "phannaceutically acceptable" means approved by a 
regulatory agency of the Federal or a state government or listed in the U.S. Pharmacopeia 
or other generally recognized pharmacopeia for use in animals, and more particularly in 
humans. The term "excipient" refers to a diluent, adjuvant, excipient, or vehicle with 

15 which the compound is administered. Such pharmaceutical carriers can be sterile liquids, 
such as water and oils, including those of petroleum, animal, vegetable or synthetic origin, 
such as peanut oil, soybean oil, ntiinaral oil, sesame oil and the like. Water or aqueous 
solution saline solutions and aqueous dextrose and glycerol solutions are preferably 
employed as excipients, particularly for injectable solutions. Suitable pharmaceutical 

20 excipients are described in "Remington's Pharmaceutical Sciences" by E.W. Martin. 

The pharmaceutical compositions according to the invention may be equally well 
administered by the oral, rectal, parenteral, intravenous, subcutaneous or intradermal route. 

According to another aspect, the subject of the invention is also a preventive and/or 
curative thorapeutic method of treating diseases caused by a deficiency m the transport of 

25 cholesterol or inflanamatory lipid substances, comprising administering to a patient or 
subject a nucleic acid encoding any one of ABCA5, ABCA6, ABCA9, and ABCAIO 
polypeptides, said nucleic acid being combined with one or more physiologically 
compatible vehicles and/or excipients. 

In another embodiment, the nucleic acids, , recombinant vectors, and compositions 

30 according to the invention can be delivered in a vesicle, in particular a liposome (see 
Langer, 1990, Science, 249:1527-1533; Treat et al., 1989, Liposomes in the Therapy of 
Infectious Disease and Cancer^ Lopez-Berestein and Fidler (eds,), Liss: New York, pp. 
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353-365; and Lopez-Berestein, 1989, In: Liposomes in the Therapy of Infectious Disease and 
Cancer, Lopez-Berestein and Fidler (eds.), Liss: New York, pp. 317-327). 

In a further aspect, recombinant cells that have been transformed with a nucleic add 
according to the invention and that express high levels of any one of ABCA5, ABCA6, 

5 ABCA9, and ABCAIO polypeptides according to the invention can be transplanted in a 
subject in need of any one of ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides. 
Preferably autologous cells transformed with an any one of ABCA5, ABC A6, ABCA9, and 
ABCAIO aacoding nucleic acids according to the invention are transplanted to avoid 
rejection; alternatively, technology is available to shield non-autologous cells that produce 

10 soluble factors within a polymer matrix that prevents immune recognition and rejection. 

A subject in whom administration of the nucleic acids, polypeptides, recombinant 
vectors, recombinant host cells, and compositions according to the invention is performed 
is preferably a human, but can be any animal. Thus, as can be readily appreciated by one of 
ordinary skill in the art, the methods and pharmaceutical compositions of the present 

15 invention are particularly suited to administration to any animal, particularly a mammal, 
and including, but by no means limited to, domestic animals, such as feline or canine 
subjects, farm animals, such as but not limited to bovine, equine, caprine, ovine, and 
porcine subjects, wild animals (whetha: in the wild or in a zoological garden), research 
animals, such as mice, rats, rabbits, goats, sheep, pigs, dogs, cats, etc., avian species, such 

20 as chickens, turkeys, songbirds, etc., for veterinary medical use. 

Preferably, a pharmaceutical composition comprising a nucleic acid, a recombinant 
vector, or a recombinant host cell, as defined above, will be administered to the patient or 
subject. 

25 METHODS OF SCREENING AN AGONIST OR ANTAGONIST COMPOUND 
FOR THE ABCA5, ABCA6, ABCA9, and ABCAIO POLYPEPTIDES 

According to another aspect, the invention also relates to various methods of 
screening compounds or small molecules for therapeutic use which are usefiil in the 
30 treatment of diseases due to a deficiency in the transport of cholesterol or inflammatory 
lipid substances. 

The invention therefore also relates to the use of any one of ABCA5, ABCA6, 
ABCA9, and ABCAIO polypeptides, or of cells expressing any one of ABCA5, ABCA6, 
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ABCA9, and ABCAIO polypeptides, for screening active ingredients for the prevention 
and/or treatment of diseases resulting from a dysfunction in ABCAS, ABCA6, ABCA9, or 
ABCAIO defiencies . The catalytic sites and oligopeptide or immunogenic fragments of 
any one of ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides can serve for screening 
5 product libraries by a whole range of existing techniques. The polypeptide fragment used in 
this type of soreening may be free in solution, bound to a solid support, at the cell surface 
or in the cell. The formation of the binding complexes between any one of the ABCA5, 
ABCA6, ABCA9, and ABCAIO polypeptide fragments and the tested agent can then be 
measured. 

10 Another product screemng technique which may be used in high-flux screenings 

giving access to products having affinity for the protein of interest is described in 
application WO84/03564. In this method, applied to any one of ABCA5, ABCA6, ABCA9, 
and ABCAIO proteins, various products are synthesized on a solid surface. These products 
react with corresponding ABCA5, ABCA6, ABCA9, and ABCAIO proteins or fragments 

15 thereof and the complex is washed. The products binding any one of the ABCA5, ABCA6, 
ABCA9, and ABCAIO proteins are then detected by methods known to persons skilled in 
the art. Non-neutralizing antibodies can also be used to capture a peptide and immobilize it 
on a support. 

Another possibility is to perform a product screening method using any one of the 
20 ABCA5, ABCA6, ABCA9, and ABCAIO neutrali2dng competition antibodies, at least one 
of ABCA5, ABCA6, ABCA9, and ABCAIO proteins and a product potentially binding any 
one of the ABCA5, ABCA6, ABCA9, and ABCAIO proteins. In this manner, the 
antibodies may be used to detect the presence of a peptide having a common antigenic unit 
with any one of ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides or proteins. 
25 Of the products to be evaluated for their ability to increase activity of any one of 

ABCA5, ABCA6, ABCA9, and ABCAIO, there may be mentioned in particular kinase- 
specific ATP homologs involved in the activation of the molecules, as well as 
phosphatases, which may be able to avoid the dephosphorylation resulting from said 
kinases. There may be mentioned in particular inhibitors of the phosphodiesterase (PDE) 
30 theophylline and 3-isobutyl-l-methylxanthine type or the adenylcyclase forskolin 
activators. 

Accordingly, this invention relates to the use of any method of screening products, 
/.e., compounds, small molecules, and the like, based on the method of translocation of 
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cholesterol or lipophilic substances betv^een the membranes or vesicles, this being in all 
synthetic or cellular types, tiiat is to say of manunals, insects, bacteria, or yeasts expressing 
constitutively or having incorporated any one of human ABCA5, ABCA6, ABCA9, and 
ABCAIO encoding nucleic acids. To this effect, labeled lipophilic substances analogs may 
5 be used. 

Furthennore, knowing that the disruption of numerous transporters have been described 
(van den Hazel et al., 1999, J, Biol Chem, 274: 1934-41), it is possible to think of using 
cellular mutants having a characteristic phenotype and to complement the function 
thereof with at least one of ABCA5, ABCA6, ABCA9, and ABCAIO proteins and to use 
10 the whole for screening purposes. 

The invention also relates to a method of screening a compound or small molecule 
active on the transport of lipophilic substances, an agonist or antagonist of any one of 
ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides, said method comprising the 
following steps: 

15 a) preparing a membrane vesicle comprising at least one of the ABCA5, ABCA6, 

ABCA9, and ABCAl 0 polypeptides and a lipid substrate comprising a detectable marker; 

b) incubating the vesicle obtained in step a) with an agonist or antagonist candidate 
compound; 

c) qualitatively and/or quantitatively measuring release of the lipid substrate 
20 comprising a detectable marker; and 

d) comparing the release measurement obtained in step b) with a measurement of 
release of labeled lipophilic substrate by a vesicle that has not been previously incubated 
with the agonist or antagonist candidate compound. 

ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides comprise an amino acid 
25 sequence selected from SEQ ID NOs: 5-8. 

Accordmg to a first aspect of the above screening method, the membrane vesicle is 
a synthetic lipid vesicle, which may be prepared according to techniques well known to a 
person skilled in the art According to this particular aspect, ABCA5, ABCA6, ABCA9, 
and ABCAIO proteins may be recombinant proteins. 
30 According to a second aspect, the mCTibrane vesicle is a vesicle of a plasma 

membrane derived from cells expressing at least one of ABCA5, ABCA6, ABCA9, and 
ABCAIO polypeptides. These may be cells naturally expressing any one of ABCA5, 
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ABCA6, ABCA9, and ABCAIO polypeptides or ceUs transfected with a nucleic acid 
encoding at least one ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides or 
recombinant vector comprising a nucleic acid encoding any one of ABCA5, ABCA6, 
ABCA9, and ABCAIO polypeptides. 
5 According to a third aspect of the above screening method, the lipid substrate is 

chosen from prostaglandins or prostacyclins. 

According to a fourth aspect of the above screening method, the lipid substrate is 
chosen from cholest«-ol or phosphatidylcholine. 

According to a fifth aspect, the lipid substrate is radioactively labelled, for example 
10 with an isotope chosen from or ^^^I. 

According to a sixth aspect, the lipid substrate is labelled with a fluorescent 
compound, such as NED or pyrena 

According to a seventh aspect, the membrane vesicle comprising the labelled 
lipophilic substances and any one of the ABCA5, ABCA6, ABCA9, and ABCAIO 
15 polypeptides is immobilized at the surface of a solid support prior to step b). 

According to a eighth aspect, the measurement of the fluorescence or of the 
radioactivity released by the vesicle is the direct reflection of the activity of lipid substrate 
transport by any one of the ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides. 

The invention also relates to a method of screening a compound or small molecule 
20 active on the transport of cholesterol or lipid substances, an agonist or antagonist of any 
one of ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides, said method comprising the 
following steps: 

a) obtaining cells, for example a cell line, tiiat, either naturally or after transfecting 
flie cell with any one of ABCA5, ABCA6, ABCA9, AND ABCAIO encoding nucleic 

25 acids, expresses any one of ABCA5, ABCA6, ABCA9, AND ABCAIO polypeptides; 

b) incubating the cells of step a) in the presence of an anion labelled with a 
detectable marker; 

c) washing the cells of step b) in order to remove the excess of the labelled anion 
which has not penetrated into these cells; 

30 d) incubating the cells obtained in step c) with an agonist or antagonist candidate 

compound for any one of ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides; 
e) measuring efflux of the labelled anion; and 
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f) comparing the value of efflux of the labelled anion determined in step e) with a 
value of the efflux of a labelled anion measured with cells that have not been previously 
incubated in the presence of the agonist or antagonist candidate compound of any one of 
ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides. 
5 In a first specific embodiment, any one of the ABCA5, ABCA6, ABCA9, and 

ABCAIO polypeptides comprise an amino add sequence of SEQ ID NOs: 5-8. 

According to a second aspect, the cells used in the screening method described 
above may be cells not naturally expressing, or altematively expressing at a low level, any 
one of the ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides, said cells being 
10 transfected with a recombinant vector according to flie invention capable of directing the 
expression of a nucleic acid encoding any one of the ABCA5, ABCA6, ABCA9, and 
ABCAIO polypeptides. 

According to a third aspect, the cells may be cells having a natural deficiency in 
anion transport, or cells pretreated with one or more anion channel inhibitors such as 
15 Verapamil™ or tetraethylammonium. 

According to a fourth aspect of said screening method, the anion is a radioactively 
labelled iodide, such as the salts K^^^I or Na^^^L 

According to a fifth aspect, the measurement of efflux of the labelled anion is 
determined periodically over time during the experiment, thus making it possible to also 
20 establish a kinetic measurement of this efflux. 

According to a sixth aspect, the value of efflux of the labelled anion is determined 
by measuring the quantity of labelled anion present at a given time in the cell culture 
supernatant 

According to a seventh aspect, the value of efflux of the labelled anion is 
25 determined as the proportion of radioactivity found in the cell culture supematant relative 
to the total radioactivity corresponding to the sum of the radioactivity found in the cell 
lysate and the radioactivity found in the cell culture supranatant. 

The subject of the invention is also a method of screening a compound or small 
molecule active on the metabolism of hpophihc substances, an agonist or antagonist of any 
30 one of ABCA5, ABCA6, ABCA9, and ABCAIO polypeptides, said method comprising 
the following steps: 

a) culturing cells of a human myocyte line in an appropriate culture medium, in the 
presence of purified human albumin; 
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b) incubating the cells of step a) simultaneously in the presence of a compound 
stimulating the production of int^leukine and of an agonist or antagonist candidate 
compound; 

c) incubating the cells obtained in stepb) in the presence of an appropriate 
5 concentration of ATP; 

d) measuring interleukinereleased into the cell culture supernatant; and 

e) comparing the value of the release of the interleukineobtained in step d) with the 
value of the interleukinereleased into the culture supernatant of cells which have not been 
previously incubated in the presence of the agonist or antagonist candidate compound. 

10 According to a first aspect of the screening method described above, the cells used 

belong to the human or mousemyocytes. 

According to a second aspect of the scre^ng method, the compound stimulating 
the production of interleukineis a lipopolysaccharide. 

According to a third aspect of said method, the production of all interleukinesand 
15 TNF alpha by these cells is also qualitatively and/or quantitatively detemiined. 

According to a fourth aspect, the level of expression of the messenger RNA 
encoding interleukineis also determined. 

The following examples are intended to further illustrate the present invention but 
do not limit the invention. 

20 

EXAMPLES 

EXAMPLE 1 : Search of human ABCA5, ABCA6, ABCA9, AND ABCAIO genes in 
genomic database 

25 Expressed sequence tags (EST) of ABCAl-like genes as described by Allikmets et 

al. (Hum Mol Genet, 1996, 5, 1649-1655) were used to search Gonbank and UniGene 
nucleotide sequence databases using BLAST2 (Altschul et al, 1997, Nucleic Acids Res., 
25:3389-3402). The main protein sequences databases screened were Swissprot, TrEMBL, 
Genpept and PIR. 

30 The genomic DNA analysis was performed by combination of several gene- 

finding programs such as GENSCAN (Surge and Karlin, 1997, J Mol BioL; 268(1 ):78- 
94), FGENEH/FEXH (Solovyev and Salamov, 1997, lsmb\ 5:294-302), and XPOUND 
(Thomas and Skolnick, 1994, J Math AppI Med BioL'^10yA-1&). The combination of 
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different tools lead to increase sensitivity and S5)ecificity. The second step in the 
genomic DNA analysis is the homology searching in the EST and protein databases. 
Combination of software performing database searching and software for exon/intron 
prediction give the best sensitive and specific results. Sequence assembly and analysis 
5 were performed using the Genetics Computer Group (GCG) sequence analysis software 
package. 

Multiple alignments were generated by GAP software from GCG package and 
the Dialiga2 program (Morgenstem et al, 1996, Proc Natl Acad Sd U S A.; 
93(22): 12098-103), the FASTA3 package (Pearson and Lipman, 1988, Proc Natl Acad 

10 Sci U S A.; 85(8):2444-8) and SIM4 (Florea et al, 1998, Genome Res. 1998 
Sep;8(9):967-74). The specific ABCA motifs used in our process were the TMN, TMC, 
NBDl and NBD2 described in the literature (Broccardo et al, 1999). This corresponds in 
ABCAl to residues 630-846 for the N terminal (TMN = exon 14-16) and from 1647- 
1877 for the C terminal set of membrane spanners (TMC = exon 36-40). The NBD 

15 corresponds to the extended nucleotide binding domain, Le^ in ABCAl it spans from 
amino acids 885-1 152 for the N-terminal one (NBDl = exon 18-22) and 1918-2132 for 
the C-terminal one (NBD2 = exon 42-47). 

Sequence comparison between candidate ABCA ESTs with two overlappmg BAG 
clones containing the ndcrosatellite marker D17S940 (GenBank accession # AC005495, 

20 AC005922, revealed surprisingly that all these ESTs are located within this 325,000 bp. An 
electronic intron/exon prediction was performed by using the AC005495 and AC005922 
B AC sequences, and provided transcript sequences which were predictied to correspond to 
the fill! coding sequence (CDS) of ABCA6 and ABCA9. ABCA5 gene sequences were 
found to be partially contain in the contig of BACs as 3' and 5' ends, respectively. 

25 Moreover, the analysis of Ihe sequence revealed the ABCAl 0 gene. 

Additional sequence information for ABCA5 was obtamed by using the working 
draft BACs that overiap with the above described BAC contig (AC005495 and AC005922) 
on both 3' and 5' ends. A supplemental BAC working draft, og., GenBank Accession 
number AC007763, was then identified on the 5' end. A parallel database mining 

30 approach based on the specific motifs search in the different Genbank subdivisions and 
UniGene Homo salens led to identification of two of these sequences (one contains a 
TMC motif, one contains a NBDl domain) which matched with two fragments of the BAC 
#AC007763. 



wo 02/46458 PCT/EPOl/15401 

104 

Using exon-intron sequence of these genes, we compared the sequence of the 
cDNAs with the genomic sequence of the BACs (AC005495, AC005922 and AC007763) 
and established the approximate genomic size and respective orientation of the ABCA5, 
ABCA6, ABCA9, and ABCAIO genes. 

5 

EXAMPLE 2 : 5' Extension of the human ABCA5, ABCA6, ABCA9, AND ABCAIO 
cDNA. 

This Example describes the isolation and identification of cDNA molecules 
encoding the fiill Iragth human ABCA5, ABCA6, ABCA9, and ABCAIO protein. 5' 

10 extension of the partial ABCA5, ABCA6, ABCA9, and ABCAIO cDNA sequence was 
performed by using a combination of 5' RACE and RT-PCR on liver, heart, or testis. 

Oligonucleotide primers allowing to distinguish novel ABCA5-6 and 9-10 genes 
from other family members, were designed taking advantage of the exonic/intronic 
prediction of the genomic sequence and used to identify specific cDNA transcript by RT- 

15 PCR on RNA from various human tissues. With the exception of ABCAl 0 that required an 
additional cloning step, all RT-PCR products were directly sequenced, hi the case of 
ABCA6, ABCA9, and ABCAIO a 5' RACE step was also performed in order to confirm 
the initiator ATG codon. The identification of the full CDS of ABCA5 was obtained by 
linking the 3 potential firagments of the transcript by RT-PCR and direct sequencing. 

20 Finally, full ORF sequences of these new genes that belong to the same chromosome 17 
cluster were determined. 

Reverse transcription 

hi a total volume of 11.5 |xl, 500 ng of mRNA poly(A)+ (Clontech) mixed with 
25 500 ng of oligodT are denaturated at TO^'C for 10 min and then chilled on ice. After 
addition of 10 units of RNAsin, 10 mM DTT, 0.5 mM dNTP, Superscript first strand 
buffer and 200 units of Superscript II (Life Technologies), the reaction is incubated for 
45 min at 42°C. We used poly(A) mRNA fi:om liver, heart, brain and lung for ABCA9, 
fi-om testis for ABCA5, fi:om testis and heart for ABCAIO. 

30 

PCR 

Each polymerase chain reaction contained 400 each dNTP, 2 units of Thermus 
aquaticus (Taq) DNA polymerase (Ampli Taq Gold; Perkin Ehner), 0.5 fiM each primer. 
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2.5 mM MgCl2, PGR buffer and 50 ng of DNA, or about 25 ng of cDNA, or l/50e of 
primaiy PGR mixture. Reactions were carried out for 30 cycles in a Perkin Ehner 9700 
thermal cycler in 96-well microtitea: plates. After an initial denaturation at 94°C for 10 min, 
each cycle consisted of: a denaturation step of 30 s (94°G), a hybridization step of 30 s 
5 (64°C for 2 cycles, 6VC for 2 cycles, 58°C for 2 cycles and 55°C for 28 cycles), and an 
elongation step of 1 min/kb (72^C). PGR ended with a final 72**G extension of 7 min. In 
case of RT-PGR, control reactions without reverse transcriptase and reactions containing 
water instead of cDNA were performed for every sample. 



DNA Sequencing 

PGR products are analyzed and quantified by agarose gel electrophoresis, purified 
with a PI 00 column. Purified PGR products were sequenced using ABI Prism BigDye 
terminator cycle sequencing kit (Perkin Ehner AppUed Biosystems). The sequence reaction 
15 mixture was purified using Microcon-lOO microconcentrators (Amicon, Inc., Beverly). 
Sequencing reactions were resolved on an ABI 377 DNA sequencer (Peikin Elmer Applied 
Biosystems) according to manufacturer's protocol (Applied Biosystems, Perkin Elmer). 



5' Rapid amnUfication of cDNA Ends (RAGE^ 

20 5* RACE analysis was performed using the SMART RAGE cDNA amplification 

kit (Glontech, Palo Alto, GA). Human testis, liver and heart polyA+ RNA (Glontech) 
were used as template to generate the 5' SMART cDNA library according to the 
manufacturer's instructions. First-amplification primers and nested primers were 
designed fix)m the cDNA sequence. Amplimers of the nested PGR were cloned. Insert of 

25 specific clones are amplified by PGR with universal primers (Rev and -21) and 
sequenced on both strands. Primers ABG-A6_L1, L2, ABG-A9JL1, L2, and ABG- 
AlOJLl, L2 were used to identify 5' ends of ABC-A6, ABG-A9, and ABG-AIO 
respectively. 

30 Primers 

Oligonucleotides were selected using Prime firom GGG package or Oligo 4 
(National Biosciences, Inc.) softwares. Primers were ordered firom Life Technologies, 
Ltd and used without fiirfher purification (Table 15). 
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Table IS: RT-PCR and S^RACE primers 



SEQ 
ID 
NO: 



Name 



Sequence 



Orientatio 
n 



1 0^7 






X^v/1 WaiU. 






A TrirtTTTPTPPTP A P A A P 
VjAiuul 1 1 ^ 1 v^L^ 1 ^ A^ AA^ 








PAPPArJAP A AXnAl'inATrrA 
V^Av^V-/ AvJAl-/AA 1 UAvJVJA 1 vJ/\ 


r V/l W oTU 






nPTATATTPTTP A ATHriP A 




I j1 




PPT An A AriTAnAPPriPPTT* 
AvjAA^JI AVJAv^v-<OOv^1 1 


rorwoivj. 


132 


ABCA5_G 


GTTGTGAGGAGAAACCATC 


Forward 


133 


ABCA5JI 


CTGGATGGTTTCAGTCACA 


Reverse 




34D / /U_A 


pAriA A A AnppA ATPrinrrTn 

U AVjAAAAOV>L*AA i t^OVjfVJ 1 VJ 


PUrWoiVi 


1 


o45 / /U 


PP A PiflT AX ATnTTriTTT A A PP Afl 
V.^L/AVJOl Ai AlVJl IvJi 1 1 AA^LyAVJ 


XvCVClov 


1 l/C 


i4D / /U_U 


rtrmxp a n a tt a ptp^pptt a p 


POiWcUU. 




J45 / /\)_JJ 


riA APATXriA AnA APPA A PAP 
VJ AAi^ A i 1 VJ AAVJAAV-^V^ AAV^ AL. 


jvevcTsc 


1 JO 


343 / /U__r 


nXA AnnP AITTA ATPmAPPP 
VJ 1 AAVJvJVv Atjr 1 AA L\^l VJA^L/V^ 




1 "Xd 

1 


44Dloo_Jt> 


rjn A A APTrmAPAriA athp 

VJvjAAAL/ i VjVJAL//\vJ/\/\ i \J\^ 




1 /in 


44Dioo_Vv 


PT A PPPT A TTTP A P A TOPP 
AUL/V^l Ax i 1 U AV^A I VJV-^^ 


rOl WoTU 


1/11 
141 


AA^I QQ T\ 

445 1 oo__lJ 


MXTXPTPPP ATA ATA AP AriP 
vjI 1 H-fIoV-^I^AI AAl AAl^AVjv^ 




14Z 


44^1oo_xi 


riPTriTTATT ATnPPlAriA A AP 
IjL/lVJl lAl 1 A i Vj\jrVj/\Vj/\/\Av^ 


X' or WoTU. 


143 


y1/l<l QQ T 1 
44D1oo_1j1 


AriAPTAPAriTA AP A A A A nPPT A fiTriP A HPP 
ALjAO 1 AL^AVJ 1 AA^ AAAAVjL/^ 1 X\\J i VjV-^/\OV^^ 


j\.cvcrs>c 


1 AA 

144 


445 1 oo_JLZ 


ATPP A ATPPTATTAnTOTrJAPA A ArSnPTTr* 
Alv^UAAlL/L/l Al 1 AVJlVJlVJAUAAAVJVJL/i IVJ 


Reverse 


14j 


A"D/^A/^ A 


TP AnP A A APPA A AflPAPTTP 
X V^ Aijv^ AAAV^V^ AAAvJv^A*^ 1 X v-^ 


X'OIAVdrtl 




A T3/^ A "D 

Ar>CAo_Jt> 


TrJAPATPA A PTPPTPP A TP A P 
X LrAV-' A 1 AAv^ X L/V^ X L^L/ A X AL/ 


17 ^irAfc A 


1 ACS 

i4y 




L/CC 1 ur 1 vjA 1 VJvjAovjr Avj 1 ILr 


rorwoTQ. 


1 >l o 

148 


AtSi^ A £ #^1 

ABCA6_C1 


•TP A TT/1Pm/^/rS A T/"2n A T A TP. 

lUAl XUL^xvjVjvjAIvjviAIAIO 


rorwaru 


1 C>l 

154 


AT>0 A "r\ 
AoCAolJJ 


A A rmPiTP A ruir a a a a a tt a p a pp 

AALrtJVj X U ALjvjAAAAA X X AC AV^V^ 


JxCVCibC 


151 


ABCA6_JJ1 


A A T/TL/"^/^ A A T^TTPn A A P 

(jAAlUClVjAAlCi XuvjAVjAU 


jveverse 


153 


ABCA6JE 


IvjVjIvjIAAX 1 X X XCv^XvjACV^O 


rorwara 


152 


AJDC/Ao_lil 


A TTP A /Tl A TT A TP AAA PTPTl 
vjAX Xv^AOAX X AXv_/AAAV-.XVjVj 


rorwaTo. 


157 


ABCA6JF 


A OT^/^/^TTT A P A T/^ A A TPi^P 

COACl ICd 1 IAvjAIOAAIUCC 


Reverse 


155 


ABCA6_F1 


vjCjAA 1 1 C AUvjAUU 1 AU 1 vjvj 


iveverse 


1 CO 

158 


A nr^ A <C 

Al3CAo_(jr 


A A PTPP A A P A A PI A CXCW Af^S. A HCZ 
AAlJ 1 Lrvj AAC AAvjAOvj 1 AL/AAL/Ur 


X^OrWaTU 


156 


ABCA6_G1 


/~* A T^T'/^TT/'-'T/^TPTPP A A P A ^ A A PP 

CjAi ICjIUIvjII uUAAUAVjAAVjU 


rorwara 


160 


ABCA6_H 


r^r^nr^ a TnTP. A TP. A riT A A TP A A P 
Ciij VJVJ A 1 tj 1 vjA X iJ Avj X AA X vjAAIj 


jvcverse 


1 cr^ 

159 


ABCAoJil 


ATPPTA ATPPPA A A APTP APP 
A 1 Vjvj 1 AA X L'CU AAAAvj X AvjC 


17 r^fO ^ 


161 


A "D/^ A /C T 


/''^TTP A TT A PTP A TP A P A TC^C^C^C^ 
C X X L.AX X Av^ X L/AX V-/AwAX V>v>L/V«y 


r orwaru 






O ATP A Ar AGGrTGGTACGG 


Reverse 


162 


ABCA6JK: 


ACAACTTCCCCAGGAACCC 


Forward 


165 


ABCA6_L 


TGCCCACACCAGTAAGCAG 


Reverse 


164 


ABCA631 


CAAGAAAAATGCTAAGTCCCAG 


Forward 


166 


ABCA6_N 


GAAAATCAGTGGCACTCAATTC 


Reverse 


167 


ABCA6_0 


TGCCACTGATTTTCTAGTCTGC 


Forward 


169 


ABCA6_P 


CCTTTCAGTTCCACCTCTCC 


Reverse 


168 


ABCA6_Q 


CTGGGATCACAAAGCCAAC 


Forward 


171 


ABCA6JR 


AATACCTTTCCTGCCCTGC 


Reverse 


170 


ABCA6_S 


TCCACACTGAGATTCTGAAGC 


Forward 


172 


ABCA6_T 


GCCTGACTCTTTGGGTGAC 


Reverse 


147 


ABCA6JL1 


GTACATGAAAACTCACCATATCCATCCC 


Reverse 


146 


ABCA6_I2 


GCAAGTGCTGTTTTATTCATTATCTGCTG 


Reverse 
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173 


ABCA9^ 


TGAGCGTGGGTCAGCAAAC 


Forward 


174 


ABCA9JB 


GCAACTCCTCCTTGGGCAAC 


Reverse 


175 


ABCA9_C 


TTTGTTGCCCAAGGAGGAG 


Forward 


176 


ABCA9J) 


GGAAAAACAAGGGAGAACATCG 


Reverse 


177 


ABCA9J1 


GCCCACTTGGATTCTTCAC 


Forward 


178 


ABCA9J1 


CCACACCTTTCAAAGCTTCTAC 


Reverse 


179 


ABCA9_G1 


ATGTGGTCCTTGAGAATGAAAC 


Forward 


180 


ABCA9JG2 


ACTGTGAAAGAAAACCTCAGGC 


Forward 


181 


ABCA9_H1 


CTTCATGTGGCAAAATCCC 


Reverse 


182 


ABCA9_H2 


TGTGCTGTCAATTTGGCATC 


Reverse 


183 


ABCA9J 


AAGAAGAAATGGGGCATAGG 


Forward 


184 


ABCA9J 


TGTATTTGGAGACAGTTCCCAC 


Reverse 


185 


ABCA9_K1 


AACAATCAGTGGCGTGGCX3 


Forward 


202 


ABCA9JL1(RACE) 


CTTGGGTAGTTTTGGATTCAGGTGC 


Reverse 


186 


ABCA9_L1 (CV) 


GACATCCAGGAGGACAGGAAAG 


Reverse 


203 


ABCA9J.2 


AGATCCATTGAAGACATTTGAGGAGTG 


Reverse 


187 


ABCA9JVI 


GCAGCCTCTTTCACTCCATAC 


Forward 


188 


ABCA9JM1 


CATTGTGTCAGGTGATGAAAAG 


Forward 


189 


ABCA9_N 


TTCATTTCTAGGCATCGCAG 


Reverse 


190 


ABCA9_N1 


CATTAGCAGGAGGATCAAAAAG 


Reverse 


191 


ABCA9J0 


TCrAGGGCTATTTTTTGGCAC 


Forward 


192 


ABCA9_P 


CGCrCCCTTTCAAAATCAC 


Reverse 


193 


ABCA9_Q1 


TGCGAGACTTTGATGAGACAC 


Forward 


194 


ABCA9jr2 


AGACCATCAGGGAGGAGAAC 


Reverse 


195 


ABCA9JU 


TGTGCCAGCAACCAAATC 


Forward 


196 


ABCA9_U1 


GCTGGAGATGAAGCTGAAGAAC 


Forward 


201 


ABCA9_JJ2 


AAGCATGATGTAGTAGTGACCC 


Forward 


197 


ABCA9_V1 


TTTCCACTTCACCGAGGG 


Reverse 


198 


ABCA9_W 


CCATGTTTTGTCTGTTGTGCC 


Forward 


199 


ABCA9_X 


CACCCATCAACCCATCATCTAC 


Reverse 


200 


ABCA9JZ 


AGGCACAACAGACAAAACATGG 


Reverse 


204 


ABCA10„A 


GATTGACATACATTTGCTTC 


Forward 


205 


ABCA103 


TACAGTGAAGAGAAATCCAG 


Reverse 


206 


ABCA10_C 


TGGAATTAGACATGCAAA 


Forward 


207 


ABCAIOJ) 


TGAAGAGGATAAGTCGGTC 


Reverse 


208 


ABCAIO^E 


TATAATCGCTGATGCTGC 


Reverse 


212 


ABCAIO I 


AGATAAGCGTGCGTCAAC 


Forward 


215 


ABCA10_^I 


TCATCAACATTTCCCAGC 


Reverse 


216 


ABCAIO^AA 


GAAATACTGGAGATGAGTCTG 


Forward 


217 


ABCA10_^ 


GAGCTTAAGAGCTTCCACC 


Reverse 


213 


ABCAIUC 


TCTTATGGGAATTGTTAGCA 


Forward 


214 


ABCAIUH 


TTATGACTGGTTCCTCCTC 


Reverse 


209 


ABCAIOJLI 


ACCAGGCCAGAGTCATTAAACTGATC 


Reverse 


210 


ABCA10JL2 


CCGAAAAGATGCACAAATATAGCCC 


Reverse 


211 


ABCA10JU2 


GTCAAAAGTTCATTCTAATTGTGGCC 


Forward 
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EXAMPLE 3: Tissue distribution of the transcripts of the ABCA5, ABCA6, ABCA9, 
and ABCAIO genes according to the invention. 

The profile of expression of the polynucleotides according to the present invention 
is determined according to the protocols for PCR-conpled reverse transcription and 

5 Northern blot analysis described in particular by Sambrooketal. (1989, Molecular cloning: a 
laboratory manual. 2ed. Cold Spring Harbor Laboratory, Cold spring Harbor, New York). 

For example, in the case of an analysis by reverse transcription, a pair of primers as 
described above may be synthesized from a cDNA of the hximan ABCA5, ABCA6, 
ABCA9, and ABCAIO genes. This primer pair may be used to detect the corresponding 

0 ABCA5, ABCA6, ABCA9, and ABCAIO cDNAs. 

The polymerase chain reaction (PGR) is carried out on cDNA templates 
corresponding to retrotranscribed polyA"^ mRNAs (Clontech). TTie reverse transcription to 
cDNA is carried out with the enzyme SUPERSCRIPT II (GibcoBRL, Life Technologies) 
according to the conditions described by the manufacturer. The polymerase chain reaction 

5 is carried out according to standard conditions, in 20 |il of reaction mixture with 25 ng of 
cDNA preparation. The reaction mixture is composed of 400 ^iM of each of the dNTPs, 
2 units of Thennus aquaticus (Taq) DNA polymerase (Ampli Taq Gold; Perkin Elmer), 
0.5 jiM of each primer, 2.5 mMMgC12, and PGR buffer. Thirty four PGR cycles 
[denaturing 30 seconds at 94**G, annealing of 30 seconds divided up as follows during the 

0 34 cycles: 64*'C (2 cycles), 6rC (2 cycles), 58°C (2 cycles), and SS^'C (28 cycles), and an 
extension of one minute per kilobase at 72°C] are carried out after a first step of denaturing 
at 94°C for 10 minutes using a Perkin Ehner9700 thermocycler. The PGR reactions are 
visualized on agarose gel by electrophoresis. The cDNA fragmrats obtained may be used 
as probes for a Northern blot analysis and may also be used for the exact determination of 

5 the nucleotide sequence. 

Northern Blot Analysis 

To study mRNA expression of the ABCA 5, ABCA6, ABCA9, and ABCAIO 
genes, human MTN (Multiple Tissue Northern) blots (Human II 7759-1, Human 7760-1, 
0 and human fetal n 7756-1, Glontech) were hybridized with specific pools of probes 
consisting in two amplimers: ABGA5_A-ABCA5_B and 445188_C-445188_P for 
ABC-A5, ABGA6_A-ABCA6„B and ABGA6_S-ABCA6jr for ABG-A6, ABCA9_A- 
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ABCA9-B and ABCA9_M1-ABCA9J^1 for ABC-A9. A unique RT-PCR product 
obtained between ABCA10J-ABCA10J3 was used for ABC-AIO (Table 15). 

Preparation of the probe 

5 PGR products were gel-purified using Qiaquick® column (Qiagen). 10-20 ng of 

purified PGR product were radiolabelled with [a^^P]dGTP (Amersham; 6000 Gi/mmol, 
10 mCi/ml) by the random priming method (Rediprime kit; Amersham) according to the 
manufacturer's protocol. Unincorporated radioactive nucleotides were separated firom 
the labelled probe by filtration on a G50 microcolumn (Pharmacia). Probe was 

10 competed with 50 \ig of denatured human Cotl DNA during 2 hours at SS^'C. 
Hybridization 

Prehybridization of Northern blot (6 hours at 42°C) with hybridization solution (5x 
SSPE, 5x Denhardtfs, 2.5% Dextran, 0.5% SDS, 50% formamide, 100 ng/ml denaturated 
sahnon sperm DNA, 40 \ig denatured human DNA) was followed by hybridization with 

15 radiolabelled probe (2.10^ cpmyml hybridization solution) and 40 jig of denatured human 
DNA. Filters were washed in 2x SSC for 30 min at room temperature, twice in 2x SSC- 
0.1% SDS for 10 min at 65°C and twice in Ix SSC-0.1% SDS for 10 min at 65°C. 
Northem blot were analyzed after overnight exposure on the Stonn (Molecular Dynamics, 
Suimyvale, CA). The human transferrin probe was used to control the amount of RNA in 

20 each lane of the membrane. 

EXAMPLE 4 : Construction of the expression vector containing the complete cDNA 
of ABCA5, ABCA6, ABCA9, or ABCAIO in mammalian ceUs 

The ABCA5, ABCA6, ABCA9, or ABCAIO genes may be expressed in 

25 mammalian cells. A typical eukaryotic expression vector contains a promoter which allows 
the initiation of the transcription of the mRNA, a sequence encoding tiie protein, and the 
signals required for the termination of the transcription and for the polyadenylation of the 
transcript. It also contains additional signals such as enhancers, the Kozak sequence and 
sequences necessary for the splicing of the mRNA. An effective transcription is obtained 

30 with the early and late elements of the SV40 vnus promoters, the retroviral LTRs or the 
CMV virus early promoter. However, cellular elements such as the actin promoter may also 
be used. Many expression vectors may be used to cany out the present invention, an 
example of such a vector is pcDNAS (hivitrogen). 
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EXAMPLE 5 : Production of normal and mutated ABCA5, ABCA6, ABCA9, or 
ABCAIO polypeptides. 

The normal ABCA5, ABCA6, ABCA9, or ABCAIO polypeptides encoded by 
5 complete corresponding cDNAs whose isolation is described in Example 2, or the mutated 
ABCA5, ABCA6, ABCA9, or ABCAIO polypeptides whose complete cDNA may also be 
obtained according to the techniques described in Example 2, may be easily produced in a 
bacterial or insect cell expression system using ttie baculovirus vectors or in mammalian 
cells with or without the vaccinia virus vectors. All the methods are now widely described 
10 and are known to persons skilled in the art. A detailed description thereof will be found for 
example in F. Ausubel et al. (1989, Current Protocols in Molecular Biology, Green Publishing 
Associates and Wiley Interscience, N.Y). 

EXAMPLE 6 : Production of an antibody directed against one of the mutated 
15 ABCA5, ABCA6, ABCA9, or ABCAIO polypeptides. 

The antibodies in the present invention may be prepared by various methods 
(Current Protocols In Molecular Biology Volume 1 edited by Frederick M. Ausubel, Roger 
Brent, Robert E. Kingston, David D. Moore, J.G. Seidman, John A. Smith, Kevin Struhl - 
Massachusetts General Hospital Harvard Medical School, chapter 11, 1989). For example, 
20 the cells expressing a polypeptide of the present invention are injected into an animal in 
order to induce the production of serum containing the antibodies. In one of the methods 
described, the proteins arc prepared and purified so as to avoid contaminations. Such a 
preparation is then introduced into the animal with the aim of producing polyclonal antisera 
having a higher activity. 

25 In the preferred method, the antibodies of the present invention are monoclonal 

antibodies. Such monoclonal antibodies may be prepared using the hybridoma technique 
(Kohler et al, 1975, Nature, 256:495 ; Kohler et al, 1976, Eur. J. Immunol. 6:292; Kohler et 
al, 1976, Eur. J. Immunol., 6:511; Hammeling et al., 1981, Monoclonal Antibodies and T- 
Cell Hybridomas, Elsevier, N.Y., pp. 563-681). In general, such methods involve 

30 immunizing the animal (preferably a mouse) with a polypeptide or better still with a cell 
expressing the polypeptide. These cells may be cultured in a suitable tissue culture 
medium. However, it is preferable to culture the cells in an Eagle medium (modified Earle) 
supplemented with 10% fetal bovine serum (inactivated at 56°C) and supplemented with 
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about 10 g /I of nonessential amino acids, 1000 U/ml of penicillin and about 100 ng/ml of 
streptomycin. 

The splenocytes of these mice are extracted and fused with a suitable myeloma cell 
line. However, it is preferable to use the parental myeloma cell line (SP20) available from 
5 the ATCC. After fusion, the resulting hybridoma cells are selectively maintained in HAT 
medium and then cloned by limiting dilution as described by Wands et al. (1981, 
Gastroenterology, 80:225-232). The hybridoma cells obtained after such a selection are 
tested in order to identify the clones secreting antibodies capable of binding to the 
polypeptide. 

10 Moreover, other antibodies capable of binding to the polypeptide may be produced 

according to a 2-stage procedure using anti-idiotype antibodies such a method is based on 
the fact that the antibodies are themselves antigens and consequently it is possible to obtain 
an antibody recognizing another antibody. According to this method, the antibodies 
specific for the protein are used to immunize an animal, preferably a mouse. The 

15 splenocytes of this animal are then used to produce hybridoma cells, and the latter are 
screened in order to identify the clones which produce an antibody whose capacity to bind 
to the specific antibody-protein complex may be blocked by the polypeptide. These 
antibodies may be used to immunize an animal in order to induce the formation of 
antibodies specific for the protein in a large quantity. 

20 It is preferable to use Fab and F(ab')2 and the other firagments of the antibodies of 

the present invention according to the methods described here. Such fragments are typically 
produced by proteolytic cleavage with (he aid of enzymes such as Papam (in order to 
produce the Fab fragments) or Pepsin (in order to produce the F(ab')2 fragments). 
Othowise, the secreted fragments recognizing the protein may be produced by applying the 

25 recombinant DNA or synthetic chemistry technology. 

For the in vivo use of antibodies in humans, it would be preferable to use 
"humanized" chimeric monoclonal antibodies. Such antibodies may be produced using 
genetic constructs derived from hybridoma cells producing the monoclonal antibodies 
described above. The methods for producing the chimeric antibodies are known to persons 

30 skilled in the art (for a review, see : Morrison (1985. Science 229:1202); Oi et al., (1986, 
Biotechnique, 4:214); Cabilly et al., US patent No. 4,816,567 ; Taniguchi et al., EP 
171496 ; Morrison et al., EP 173494 ; Neuberger et al., WO 8601533 ; Robinson et al.. 



wo 02/46458 PCT/EPO 1/15401 

112 

WO 8702671 ; Boulianne et al ; (1984, Nature, 312:643) ; and Neuberger et al., (1985, 
Nature, 314:268). 

EXAMPLE 7 : Determination of polymorphisms/mutations in the ABCA5, ABCA6, 
5 ABCA9, or ABCAIO genes. 

The detection of polymorphisms or of mutations in the sequences of the transcripts 
or in the genomic sequence of the ABCA5, ABCA6, ABCA9, or ABCAIO genes may be 
carried out according to various protocols. The prefiarred method is direct sequencing. 

For patients from whom it is possible to obtain an mRNA preparation, the preferred 

10 method consists in preparing the cDNAs and sequencing them directly. For patients for 
whom only DNA is available, and in the case of a transcript where the structure of the 
corresponding gene is unknown or partially known, it is necessary to precisely determine 
its intron-exon structure as well as the genomic sequence of the corresponding gene. This 
therefore involves, in a &st instance, isolating the genomic DNA BAC or cosmid clone(s) 

15 corresponding to the transcript studied, sequencing the insert of the corresponding clone(s) 
and detemrining the intron-exon structure by comparing the cDNA sequence to that of the 
genomic DNA obtained. 

The technique of detection of mutations by direct sequencing consists in comparing 
the genomic sequences of the ABCA5, ABCA6, ABCA9, or ABCAIO gene obtained from 

20 homozygotes for the disease or from at least 8 individuals (4 individuals affected by the 
pathology studied and 4 individuals not affected) or from at least 32 unrelated individuals 
from the studied population. The sequence divergences constitute polymorphisms. All 
those modifying the amino acid sequence of the wild-type protein may be mutations 
capable of affecting the ftinction of said protein which it is preferred to consider more 

25 particularly for the study of cosegregation of the mutation and of the disease (denoted 
genotype-phenotype correlation) in the pedigree, or of a pharmacological response to a 
therapeutic molecule in the pharmacogenomic studies, or in the studies of case/control 
association for the analysis of the sporadic cases. 



30 



EXAMPLE 8 : Identification of a causal gene for a disease linked to a deficiency in 
the transport of cholesterol and inflammatory lipid substances by causal mutation or 
a transcriptional difference 
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Among the mutations identified according to the method described in Example 7, 
all those associated with the disease phenotype are capable of being causal. Validation of 
these results is made by sequencing tiie gene in all the affected individuals and their 
relations (whose DNA is available). 
5 Moreover, Northern blot or RT-PCR analysis, according to the methods described 

in Example 2, using RNA specific to affected or nonaflfected individuals makes it possible 
to detect notable variations in the level of expression of the gene studied, in particular in 
the absence of transcription of the gene. 

10 EXAMPLE 9: Construction of recombinant vectors comprising a nucleic acid 
encoding any one of ABCAS, ABCA6, ABCA9, and ABCAIO proteins 

Synthesis of a nucleic acid encoding anv one of human ABCA5> ABCA6. ABCA9. or 
ABCAIO proteins: 

15 Total RNA (500 ng) isolated from a human cell ( for example, placental tissue, 

Clontech, Palo Alto, CA, USA, or THPl cells) may be used as source for the synthesis of 
the cDNA of the human ABCA5, ABCA6, ABCA9, and ABCAIO genes. Metiiods to 
reverse transcribe mRNA to cDNA are well known in the art. For example, one may use 
the system "Superscript one step RT-PCR (Life Technologies, Gaithersburg, MD, USA). 

20 Oligonucleotide primers specific for ABCA5, ABCA6, ABCA9, and ABCAIO 

cDNAs may be used for this purpose, containing sequences as set forth in any of SEQ ID 
NO: 127-217. These oligonucleotide primers may be synthesized by the phosphoramidite 
method on a DNA synthesizer of the ABI 394 type (Applied Biosystems, Foster City, CA, 
USA). 

25 Sites recognized by the restriction enzyme NotI may be incorporated into the 

amplified ABCA5, ABCA6, ABCA9, and ABCAIO cDNAs to flank the cDNA region 
desired for insertion into the recombinant vector by a second amplification step using 50 ng 
of human ABCA5, ABCA6, ABCA9, and ABCAIO cDNAs as template, and 0.25 \iM of 
the ABCA5, ABCA6, ABCA9, and ABCAIO specific oligonucleotide primers used above 

30 containing, at their 5' end, the site recognized by the restriction enzyme NotI (5'- 
GCGGCCGC-3'), in the presence of 200 of each of said dideoxynucleotides dATP, 
dCTP, dTTP and dGTP as well as the Pyrococcus juriosus DNA polymerase (Stratagene, 
Inc.LaJolla,CA,USA). 
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The PGR reaction may be carried out over 30 cycles each comprising a step of 
denaturation at 95°C for one minute, a step of renaturation at 50°C for one minute and a 
step of extension at 72°C for two minutes, in a thermocycler apparatus for PGR (Cetus 
Perkin Ehner Norwalk, GT, USA). 

5 

Cloning of ttie cDNA of the human ABGAS. ABCA6> ABCA9, and ABCAIO genes into 

an expression vector : 

The human ABGAS, ABCA6, ABGA9, and ABGAIO cDNA inserts may then be 

cloned into the NotI restriction site of an expression vector, for example, the pCMV vector 
10 containing a cytomegalovirus (CMV) early promoter and an enhancer sequence as well as 

the SV40 polyadenylation signal (Beg et al., 1990, PNAS, 87:3473; Applebaum-Boden, 

1996, JCI 97), in order to produce an expression vector designated pABCAS, pABGA6, 

pABCA9, andpABGAlO. 

The sequence of the cloned cDNA can be confirmed by sequencing on the two 
15 strands using the reaction set "ABI Prism Big Dye Terminator Gycle Sequencing ready" 

(marketed by Applied Biosystems, Foster Gity, GA, USA) in a capillary sequence of the 

ABI 310 type (Applied Biosystems, Foster Gity, GA, USA). 

Gonstruction of a recombinant adenoviral vector containing the cDNA of the human 
20 ABGAS. ABGA6, ABGA9, and ABGAIO genes: 

Modification of the expression vector pCMV-p: 

The P-galactosidase cDNA of the expression vector pCMV-p (Glontech, Palo Alto, 
GA, USA, Gene Bank Accession No.U02451) may be deleted by digestion with the 

25 restriction endonuclease NotI and replaced with a multiple cloning site containing, from the 
5' end to the 3' end, the following sites: NotI, AscI, Rsrll, Avrll, Swal, and NotI, cloned at 
the region of the NotI restriction site. The sequence of this multiple cloning site is: 
5'-GGGGCGGGGCGGGGGGGGAGCGGCTAGGATTTAAATCGCGGCCCGCG"3'. 

The DNA fragment between the EcoRI and SanI sites of the modified expression 

30 vector pGMV may be isolated and cloned into tiie modified Xbal site of the shuttle vector 
pXGXn (McKinnon et al., 1982, Gene, 19:33; McGrory et al., 1988, Virology, 1,63:614). 
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Modification of the shuttle vector pXCXII: 

A multiple cloning site comprising, firom the 5* end to the 3 end the Xbal, EcoRI, 
Sfil, Pmel, Nhel, Srfl, Pad, Sail and Xbal restriction sites having the sequence: 
5'CTCTAGAATTCGGCCTCCGTGGCCGTTTAAACGCTAGCGCCCGGGCl^ 
5 AAGTCGACTCTAGAGC-3', may be inserted at the level of the Xbal site (nucleotide at 
position 3329) of the vector pXCXU (McKinnon et al., 1982, Gene 19:33; McGrory et al., 
1988, Virology, 163:614). 

The EcoRI-Sall DNA fragment isolated from the modified vector pCMV-p 
containing the CMV promoter/enhancer, the donor and acceptor splicing sites of FV40 and 
10 the polyadenylation signal of FV40 may then be cloned into the EcoRI-Sall site of the 
modified shuttie vector pXCX, designated pCMV-1 1. 

Preparation of the shuttie vector pAD12-ABCA: 

The human ABCA5, ABCA6, ABCA9, aad ABCAIO cDNAs are obtained by an 
15 RT-PCR reaction, as described above, and cloned at the level of the NotI site into the 
vector pCMV-12, resulting in the obtaining of the vector pCMV-ABCA5, pCMV-ABCA6, 
pCMV-ABCA9, andpCMV-ABCAlO. 

Construction of the ABCl recombinant adenovirus: 
20 The recombinant adenovirus containing the human ABCA5, ABCA6, ABCA9, and 

ABCAIO cDNAs may be constructed according to the technique described by McGrory et 

al. (1988, Virology, 163:614). 

Briefly, the vector pAD12-ABCA is cotransfected with the vector tGM17 according 

to flie technique of Chen and Okayama (1987, Mol Cell BioL, 7:2745-2752). 
25 Likewise, the vector pAD12-Luciferase was constructed and cotransfected with the 

vector pJM17. 

The recombinant adenoviruses are identified by PGR amplification and subjected to 
two purification cycles before a large-scale amplification in the human embryonic kidney 
cell line HEK 293 (American Type Culture Collection, Rockville, MD, USA). 
30 The infected cells are collected 48 to 72 hours after tixeir infection witix the 

adenoviral vectors and subjected to five fireeze-thaw lysing cycles. 
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The crude lysates are extracted with the aid of Freon (Halocarbone 113, Matheson 
Product, Scaucus, NJ. USA), sedimented twice in cesium chloride supplemented with 
0.2% murine albumine (Sigma Chemical Co., St Louis, MO, USA) and dialysed 
extensively against buffer composed of 150 nM NaCl, 10 mM Hepes (pH 7,4), 5 mM KCl, 
5 1 mM MgCl2, and 1 mM CaCla. 

The recombinant adeno\iruses are stored at -70**C and titrated before their 
administration to animals or their incubation with cells in culture. 

The absence of wild-type contaminating adenovirus is confirmed by screening with 
the aid of PGR amplification using oligonucleotide primers located in the structural portion 
10 of the deleted region. 

Validation of the expression of the human ABCA5, ABCA6, ABCA9, and ABCAIO 
cDNAs: 

Polyclonal antibodies specific for a human ABCA5, ABCA6, ABCA9, and 
15 ABCAIO polypeptides may be prepared as described above in rabbits and chicks by 
injecting a synthetic polypq)tide fi:agment derived fi-om an ABCl protem, comprising all or 
part of an amino add sequence as described in SEQ ID NO:5-8. These polyclonal 
antibodies are used to detect and/or quantify the expression of the human ABCA5, 
ABCA6, ABCA9, and ABCAIO genes in cells and animal models by immunoblotting 
20 and/or immunodetection. 

The biological activity of ABCA5-6, 9-10 may be monitored by quantifying the 
cholesterol fluxes induced by apoA-I using cells transfected with the vector pCMV-ABCI 
which have been loaded with cholesterol (Remaley et al., 1997, ATVB, 17:1813). 

25 Expression in vitro of the human ABCA5, ABCA6, ABCA9, and ABCAIO cDNAs in 
cells: 

Cells of the HEK293 line and of the COS-7 line (American Tissue Culture 
Collection, Bethesda, MD, USA), as well as fibroblasts in primary culture derived from 
Tangier patients or fi:om patients suffering fi-om hypo-alphalipoproteinemia are transfected 
30 with the expression vector pCMV-ABCA5, pCMV-ABCA6, pCMV-ABCA9, and pCMV- 
ABCAIO (5-25 Jig) using Lipofectamine (BRL, Gaithersburg, MD, USA) or by 
coprecipitation with the aid of calcium chloride (Chen et al., 1987, Mol Cell Biol., 7:2745- 
2752). 
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These cells may also be infected with the vector pABCA5-AdV, pABCA6-AdV, 
pABCA9-AdV, andpABCAlO-AdV (Index of infection, MOI=10). 

The expression of human ABCA5-6, 9-10 may be monitored by immunoblotting as 
well as by quantification of the efflux of cholesterol induced by apoA-1 using transfected 
5 and/or infected cells. 



Ex pression in vivo of the ABCAS. ABCA6, ABCA9, and ABCAIO genes in various animal 

models: 

An appropriate volume (100 to 300 ^1) of a medium containing the purified 
10 recombinant adenovirus (pABCA-AdV or pLucif-AdV) containing fi-om 10* to 10^ lysis 
plaque-forming units (pfu) are infused into the Saphenous vein of mice (C57BL/6, both 
control mice and models of transgenic or knock-out mice) on day 0 of the experiment. 

The evaluation of the physiological role of the ABCAS, ABCA6, ABCA9, and 
ABCAIO proteins in the transport of cholesterol or inflammatory lipid substances is carried 
15 out by determining the total quantity of cholesterol or appropriate inflammatory lipid 
substances before (day zero) and after (days 2, 4, 7, 10, 14) the administration of the 
adenovirus. 

Kinetic studies with the aid of radioactively labelled products are carried out on day 
5 after the administration of the vectors rLucif-AdV and rABCA-AdV in order to evaluate 
20 the effect of the rapression of ABCAS, ABCA6, ABCA9, and ABCAIO on the transport of 
cholesterol and inflammatory lipid substances. 

Furthermore, transgenic mice and rabbits overexpressing the ABCAS, ABCA6, 
ABCA9, and ABCAIO genes may be produced, in accordance with the teaching of 
Vaisman (1995) and Hoeg (1996) using constructs containing the human ABCAS, ABCA6, 
25 ABCA9, and ABCAIO cDNAs under the control of endogenous promoters such as 
ABCAS, ABCA6, ABCA9, and ABCAIO, or CMV or apoE. 

The evaluation of the long-term effect of the expression of ABCAS, ABCA6, 
ABCA9, and ABCAIO on the kinetics of the lipids involved in the mediation of the 
inflammation maybe carried out as described above. 

30 

EXAMPLE 10: Isotopic in situ hybridization study of the ABCA9 gene 

In situ hybridization experiments were performed using a radiolabeled cRNA probe 
of 330bp corresponding to nucleotides 1 to 330 of nucleotide sequence of SEQ ID NO: 3. 
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The 330 bp insert was subcloned and transcribed in vitro with T7 (antisense) and SP6 
(sense) RNA polymerases in the presence of ^^S-uridine 5*-triphosphate. After 
transcription, the probes were column-purified and separated by electrophoresis on a 5% 
polyacrylamide gel to confirm size and purity. 

5 Serial artery and heart tissue sections were digested with Proteinase K and 

hybridized with the probes at a concentration of approximately 3.4 x 10^ dpm/ml for 18 
hours at 55*'C. Following hybridization, the slides were treated with RNAse A and 
washed stringently in O.IX SSC at 60°C for 2 hours. The slides were then coated with 
Kodak NTB-2 emulsion, exposed for 14 days at 4°C, and developed using Kodak D-19 

10 Developer and Fixer. Slides were stained with hematoxylin and eosin (H&E) and 
imaged using a D VC 1 3 1 OC camera coupled to a Nikon microscope. 

Two control probes were used in these studies. All tissues were screened initially 
with a probe for beta-actin mRNA to ensure that RNAs were preserved within the 
archival paraffin samples. Adjacent serial sections were also hybridized with a sense 

15 control riboprobe that was derived from the same region of the gene as the antisense 
probe. At these hybridization and wash stringencies, the sense control probe tended to 
produce a background signal across all tissues that was not associated with particular 
cell types and that appeared to be due to nonspecific sticking of the sense probe to the 
tissues. The nonspecific background produced by the antisense probe was less than that 

20 observed with the sense probe, and the positive antisense signals described in the 
accompanying report were specifically cell-associated and higher than the background 
signals present in nonreactive cell types. 

Figure 5 was a section of normal renal artery obtained at nephrectomy firom an 
80-year-old female with renal cell carcinoma, and showed a faint hybridization of 

25 medial smooth muscle in both arteries and veins. Adventitial nerves showed occasional 
faint positivity in Schwann cells (Figure 6). Also, in an adjacent ganglion, ganglion cells 
and Schwann cells were both faintly positive (Figure 7). 

Figures 8 and 9 display sections of normal renal artery obtained at nephrectomy 
from a 24-year-old male with congenital stenosis of the ureteropelvic junction, and show 

30 hybridization in a collecting duct epithelium and a renal tubular epithelium, 
respectively. 
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Figure 10 was a section of a normal heart obtained from a 41-year-old female 
who died of carcinoma of the cervix, and showed a moderately positive hybridization of 
the cardiac myocytes. 

Figure 1 1 was a section of a normal heart obtained from a 60-year-old male who 
5 died of non-small lung carcinoma, and showed that endothelium was occasionally 
positive in interstitial vessels. 

EXAMPLE 11: Isotopic in situ hybridization study of the ABCAIO gene 

In situ hybridization experiments have been performed using serial arterial, 

10 myocardial, and skeletal tissue sections from archival parafGn samples. 

Tissue sections were hybridized with radiolabeled cRNA probes of 405bp 
corresponding to nucleotides 1383 to 1787 of nucleotide sequence SEQ ID NO: 4, 
which was then PCR-amplified. The PGR product was then transcribed ia vitro with T7 
(antisense) and SP6 (sense) RNA polymerases in the presence of ^^S-uridine 5'- 

15 triphosphate. After transcription, the probes were column-purified and separated by 
electrophoresis on a 5% polyacrylamide gel to confirm size and purity. Tissue sections 
were digested, and in situ hybridization were perfomed as described in Example 10, 

Figures 12 and 13 which displays in situ hybridization of arterial tissues showed 
that ttie strongest hybridization was identified consistently in macrophages, subsets of 

20 lymphocytes, and in Schwann cells of nerves. 

Figure 14 which display a myocardial tissue section, showed positive signals of 
maorophages in the atheroma. In an adjacent section of tissue containing a ganglion, 
subsets of Schwann cells were moderately positive, and ganglion cells showed faint 
hybridization (Figure 15). 

25 Figure 16 displays section of skeletal tissue, wherein scattered macrophages were 

faintly to moderately positive. Figure 17 showed moderate hybridization of Schwann 
cells in a nerve. 

The present invention is not to be limited in scope by tiie specific embodiments 
described herein. Indeed, various modifications of the invention in addition to those 
30 described herein will become apparent to those skilled in the art from the foregoing 
description and the accompanying figures. Such modifications are intended to fall within 
the scope of the appended claims. 
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1 . An isolated micldc acid comprising any one of SEQ ID NOs: 1-4 and 9-126, 
or of a complementary nucleotide sequence. 
5 2. An isolated nucleic acid comprising at least eight consecutive nucleotides of 

a nucleotide sequence of any one of SEQ ID NOs: 1-4 and 9-126, or of a complementary 
nucleotide sequence. 

3. An isolated nucleic acid comprising at least 80% nucleotide identity with a 
nucleic acid comprising any one of SEQ ID NOs: 1-4 and 9-126, or of a complementary 

10 nucleotide sequence. 

4. The isolated nucleic acid according to claim 3, wherein the nucleic acid 
comprises an 85%, 90%, 95%, or 98% nucleotide identity with the nucleic acid comprising 
any one of SEQ ID NOs: 1-4 and 9-126, or of a complementary nucleotide sequence. 

5. An isolated nucleic acid that hybridizes under high stringency conditions 
15 with a nucleic acid comprising a) any one of SEQ ID NOs: 1-4 and 9-126, or of a 

complementary nucleotide sequence. 

6. An isolated nucleic acid comprising a nucleotide sequence as depicted in 
any one of SEQ ID NOs: 1-4 and 9-126, or of a complementary nucleotide sequence. 

7. A nucleotide probe or primer specific for any one of ABCA5, ABCA6, 
20 ABCA9, and ABCAIO genes, wherein the nucleotide probe or primer comprises at least 1 5 

consecutive nucleotides of a nucleotide sequence of any one of SEQ ID NOs: 1-4 and 9- 
126, or of a complementary nucleotide sequence. 

8. A nucleotide probe or primer specific for an ABCA5 gene, wherein the 
nucleotide probe or primer comprises a nucleotide sequence of any one of SEQ ID 

25 NO: 127-144, or a complementary nucleotide sequence. 

9. A nucleotide probe or primer specific for an ABCA6 gene, wherem the 
nucleotide probe or primer comprises a nucleotide sequence of any one of SEQ ID NOs: 
145-172, or of a complementary nucleotide sequence. 

10. A nucleotide probe or prim^ specific for an ABCA9 gene, wherein the 
30 nucleotide probe or primer comprises a nucleotide sequence of any one of SEQ ID NOs: 

173-203, or of a complementary nucleotide sequence. 
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11. A nucleotide probe or primer specific for an ABCAIO gene, wherein the 
nucleotide probe or primer comprises a nucleotide sequence of any one of SEQ ID NOs: 
204-217, or of a complementary nucleotide sequence. 

12. A method of amplifying a region of the nucleic acid accordmg to claim 1, 
5 wherein the method comprises: 

a) contacting the nucleic add with two nucleotide primers, wherein the &st 
nucleotide primer hybridizes at a position 5' of the region of the nucleic add, and the 
second nucleotide primer hybridizes at a position 3' of the region of the nucldc acid, in the 
presence of reagents necessary for an amplification reaction; and 
10 b) detecting the amplified nucleic acid region. 

13. A method of amphfying a region of the nucldc acid according to claim 12, 
wherein fee two nucleotide primers are selected firom the group consisting of 

a) a nucleotide primer comprising at least 15 consecutive nucleotides of a 
nucleotide sequence of any one of SEQ ID NOs: 1-4 and 9-126, or of a complementary 

15 nucleotide sequence, 

b) a nucleotide primer as in any one of claims 7-1 1 , 

c) a nucleotide primer comprising a nucleotide sequence of any one of SEQ ID 
NOs: 127-217, or a nucleic acid having a complementary sequence. 

14. A kit for amplifying the nucleic acid according to claim 1, wherein the kit 
20 comprises: 

a) two nucleotide primers whose hybridization position is located respectively 5' 
and 3 ' of the region of the nucldc acid; and optionally, 

b) reagents necessary for an amplification reaction. 

15. The kit according to claim 14, wherein the two nucleotide primers are 
25 selected firom the group consisting of 

a) a nucleotide primer comprising at least 15 consecutive nucleotides of a 
nucleotide sequence of any one of SEQ ID NOs: 1-4 and 9-126, or of a complementary 
nucleotide sequence, 

b) a nucleotide primer as ia any one of claims 7-1 1, 

30 c) a nucleotide primer comprising a nucleotide sequence of any one of SEQ ID 

NOs: 127-217, or a nucleic acid having a complementary sequence. 

1 6. The nucleotide probe or primer according to any of claim 7-11, wherein the 
nucleotide probe or primer comprises a marker compound. 
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17. A method of detecting a nucleic acid according to claim 1, wherein the 
method comprises: 

a) contacting the nucleic acid with a nucleotide probe selected from the group 
consisting of 

5 1) a nucleotide probe comprising at least 1 5 consecutive nucleotides of a 

nucleotide sequence of any one of SEQ ID NOs: 1-4 and 9-126, or of a complementary 
nucleotide sequence, 

2) anucleotideprimer as in any one of claims 7-11, 

3) a nucleotide probe comprising a nucleotide sequence of any one of SEQ 
10 ID NOs: 1 27-2 1 7, or of a complementary nucleotide sequence, and 

b) detecting a complex formed between the nucleic acid and the probe. 

18. The method of detection according to claim 17, wherein the probe is 
immobilized on a support. 

19. A kit for detecting the nucleic acid according to claim 1, wherein the kit 
15 comprises 

a) a nucleotide probe selected from the group consisting of 1) a nucleotide 

probe comprising at least 15 consecutive nucleotides of a 

nucleotide sequence of any one of SEQ ID NOs: 1-4 and 9-126, or of a complem^tary 
nucleotide sequence, 
20 2) a nucleotide primer as in any one of claims 7-11, 

3) a nucleotide probe comprising a nucleotide sequence of any one of SEQ 
ID NOs: 127-217, or of a complementary nucleotide sequence, and optionally, 
b) reagents necessary for a hybridization reaction. 

20. The kit according to claim 19, wh^ein the probe is immobilized on a 
25 support. 

21. A recombinant vector comprising the nucleic acid according claim I . 

22. The vector according to claim 21 , wherein ttie vector is an adenovirus. 

23. A recombinant host cell comprising the recombinant vector according to 
claim 21. 

30 24. A recombinant host cell comprising the nucleic acid according claim 1 . 

25. An isolated nucleic acid encoding a polypeptide comprising an amino acid 
sequence of any one of SEQ ID NO: 5-8. 

26. A recombinant vector comprising the nucleic acid according to claim 25. 
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27. A recombinant host cell comprising the nucleic acid according to claim 25. 

28. A recombinant host cell comprising the recombinant vector according to 
claim 26. 

29. An isolated polypeptide selected fiom the group consisting of 

5 a) a polypeptide comprising an amino add sequence of any one of SEQ ID NOs: 5- 

8, 

b) a polypeptide fragment or variant of a polypeptide comprising an amino acid 
sequence of any one of SEQ ID NOs: 5-8, and 

c) a polypeptide homologous to a polypeptide comprising amino acid sequence of 
10 any one of SEQ ID NO: 5-8. 

30. An antibody directed against the isolated polypeptide according to claim 29. 

31. The antibody according to claim 30, wherein the antibody comprises a 
detectable compound. 

32. A method of detecting a polypeptide, wherein the method comprises 
1 5 a) contacting the polypeptide with an antibody according to claim 3 1 ; and 

b) detecting an antigen/antibody complex formed between the polypeptide and the 
antibody. 

33. A diagnostic kit for detecting a polypeptide, wherein the kit comprises 
a) the antibody according to claim 3 1 ; and 

20 b) a reagent allowing detection of an antigen/antibody complex formed between the 

polypeptide and the antibody. 

34. A pharmaceutical composition comprising the nucleic add according to 
claim 1 and a physiologically compatible excipient. 

35. A pharmaceutical composition comprising the recombinant vector according 
25 to claim 21 and a physiologically compatible excipient. 

36. Use of the nucleic add according to claim 1 for the manufacture of a 
medicament intended for the prevention and/or treatmmt of a subject affected by a 
dj^function in the reverse transport of cholesterol. 

37. Use of a recombinant vector according to claim 21 for the manufacture of a 
30 medicament for the prevention and/or treatment of subjects affected by a dysfunction in the 

lipophilic subtance transport. 

38. Use of any one of isolated ABCA5, ABCA6, ABCA9, and ABCAIO 
polypeptides comprising an amino acid sequence of SEQ ID NO: 5-8 for the manufacture 
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of a medicament intended for the prevention and/or treatment of subjects affected by a 
dysfunction in the lipophilic subtance transport. 

39. A pharmaceutical composition comprising a polypeptide comprising an 
amino acid sequence of any one of SEQ ID NOs: 5-8, and a physiologically compatible 

5 excipient. 

40. Use of any one of isolated ABCA5, ABCA6, ABCA9, and ABCAIO 
polypeptides comprising an amino acid sequence of any one of SEQ ID NOs: 5-8 for 
screening an active ingredient for the prevention or treatment of a disease resulting from a 
dysfunction in the lipophilic subtance transport. . 

10 41 . Use of a recombinant host cell expressing any one of the ABCA5, ABCA6, 

ABCA9, and ABCAIO polypeptides comprising an amino acid sequence of SEQ ID NOs: 
5-8 for screening an active ingredient for the prevention or treatment of a disease resulting 
from a dysfunction in the lipophilic subtance transport.. 

42. A method of screening a compound active on cholesterol metabolism, an 
15 agonist, or an antagonist of any one of the ABCA5, ABCA6, ABCA9, and ABCAIO 

polypeptides, wherein the method comprises 

a) preparing a membrane vesicle comprising at least one of the ABCA5, ABCA6, 
ABCA9, and ABCAIO polypeptides and a lipid substrate comprising a detectable marker; 

b) incubating the vesicle obtained in step a) with an agonist or antagonist candidate 
20 compound; 

c) qualitatively and/or quantitatively measuring a release of the lipid substrate 
comprising the detectable marker; and 

d) comparing the release of the lipid substrate measured in stq)b) with a 
measurement of a release of a labeled lipid substrate by a membrane vesicle tiiat has not 

25 been previously incubated with the agonist or antagonist candidate compound. 

43. A method of screening a compound active on cholesterol metabolism, an 
agonist, or an antagonist of any one of ABCA5, ABCA6, ABCA9, and ABCAIO 
polypeptides, wherein the method comprises 

a) incubating a cell that expresses at least one of the ABCA5, ABCA6, ABCA9, 
30 and ABCAIO polypeptides with an anion labeled with a detectable marker; 

b) washing the cell of step a) whereby excess labeled anion that has not penetrated 
into the cell is removed; 
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c) incubating the cell obtained in step b) with an agonist or antagonist candidate 
compound for any one of the ABCA5, ABCA6, ABCA9, and ABCAIO polypeptide; 

d) measuring efflux of the labeled anion from the cell; and 

e) comparing the efflux of the labeled anion determined in step d) with efflux of a 
5 labeled anion measured with a cell that has not been previously incubated with the agonist 

or antagonist candidate compound.. 

44. An implant comprising the recombinant host cell according to claim 23. 
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Figure 1 : Cluster ABCA 5,6,9,10 chromosome 17 
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Figure 2 



DIALIGN 2.0 

Aligned sequences: length: 



1) ABC-A-l-l.pep 2261 

2) ABC- A- 5 -2. pep 1641 

3) ABC- A- 6 -3. pep 1617 

4) ABC- A- 8-1. pep 1581 

5) ABC-A-9-3.pep 1624 



Alignment (DIALIGN format) : 



ABC- 


-A-1 


-1. 


1 


MAC 


ABC- 


-A-5 


-2. 


1 


mstalREVGV 


ABC- 


-A- 6 


-3. 


1 


MN— MKQKSV 


ABC- 


-A-8 


-1. 


1 


mr--KRKISV 


ABC- A- 9 


-3. 


1 


MS — KRHMSV 










****** 



** ********** ********** ********** **#* 



ABC -A-1 -1. 44 Isyppyeqhe chfpnkamps agtlpwvqgi ic3:iannpcfr yptPGEAPGV 

ABC-A-5-2. 51 mmhpnkk-YE EVPNI ^ 

ABC-A-6-3. 49 ssmmvqf PGMAPQN 

ABC-A-8-1. 49 hs-HQVNDFS SLLTM ° 

ABC-A-9-3. 49 snlHQVHDTP QMSSM ^ 

******* ***** ******* 



ABC-A-1-1. 94 VGNFNKsiva rlfsdarrll lysqkdtsmk dmrkvlrtlq qikksssnlk 

ABC-A-5-2. 66 LNPMDKFTLS NIi 

ABC-A-6-3. 64 LGRVDKFNSS SL 

ABC-A-8-1. 64 LGRVDTFNES RF 

. . 65 LGRVDSFNDT NY 

********** ** 
********** ** 

ABC-A-1-1. 144 Iqdflvdnet fsgflyhnls Ipkstvdkna radvilhkvf Iqgyqlhlts 

ABC-A-5-2. 78 ~ 

ABC-A-6-3 . 76 

ABC-A-8-1. 76 

ABC-A-9-3 . 77 



ABC-A-1-1. 194 Icngskseem iqlgdqevse Icglprekla aaervlrsmn dilkpilrtl 

ABC-A-5-2. 78 

ABC-A-6-3. 76 311 

ABC-A-8-1. 76 

ABC-A-9-3. 77 
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ABC-A-l-l. 244 nstspfpsJce laeatktllh slgtlaqelf smrswsdmrq evmfltnvns 

ABC-A-5-2. 78 

ABC-A-6-3 . 76 

ABC-A-8-1. 76 

ABC-A-9-3. 77 



ABC-A-l-l. 294 sssstqiyqa vsrivcghpe ggglki3csln wyedrmykal fggngteeda 

'aBC-A-5-2. 78 

ABC-A-6-3. 76 

ABC-A-8-1. 76 

ABC-A-9-3. 77 



ABC-A-1-1. 344 etfydnsttp ycndlatale ssplsriiwk alkpllvgKI LYTPDTPATR 

ABC-A-5-2. 78 GYTPVTNITS 

ABC-A-6-3. 76 MVVYTPISNLTQ 

ABC-A-8-1. 76 -SVVYTPVTNTTQ 

ABC-A-9-3. 77 ^ AFAPESKTTQ 



ABC-A-1-1. 394 QVMAEVNKTf qelavfhdle gmweelspki wtfmensqem dlvrmlldsr 

ABC-A-5-2. 90 SIMQKVSTdh IpdviiteeY TNE 

ABC-A-6-3. 88 QIMNKTALAP LLKGtsVIGA PNK 

ABC-A-8-1, 88 QIMNKVASTP FLAGKEVLGIj PDE 

ABC-A-9-3. 89 EIMNKVASAP FLKGRTIMGW PDE 

********** ********** *** 
******** 



ABC-A-1-1. 444 dndhfweqql dgldwtaQDI VAFUVKHPED Vqssngsvyt wreafnetnq 

ABC-A-5-2. 113 KEM LTSSLSKPSN PVGWFKDSM SYEIiRFFpditl 

ABC-A-6-3! Ill THM DEIIiLENLPY AMGIIFNETF SYKLIFFQGY 

ABC-A-8-1! Ill ESI KEPTANYPEE IVRVTFTNTY SYHLKFLLGH 

ABC-A-9-3. 112 KSM DELDIJWSID AVRVIFTDTF SYHIiKFSWGH 

*** ********** ********** ********** 



ABC-A-1-1. 494 airtisrfme cvnlnklepi atevwlinks mellderkfw agivftgitp 

ABC-A'-5-2. 146 ipvssiymds ragcsk SO 

ABC-A-6-3. 144 NSPLWKE— D FSAHCWDGYG EFSC 

ABC-A-8-1. 144 OIPAKKEHKD HTAHCYETNE dvyC 

ABC-A-9-3. 145 RIPMMKEHRD HSAHCQAVNE KMKC 

********** ********** **** 



ABC-A-1-1. 544 gsielphhvk yklrmdldnv ertnkikdgy wdpgpradpf EEMRYVWGGP 



ABC-A-5-2 . 164 



EAAQYWSSGF 



ABC-A-.6-3. 166 " TLTKYWNRGF 



ABC-A-8-1. 168 



EVSVFWKEGF 



ABC-A-9-3. 169 EGSEFWEKQF 



********** 
********** 



wo 02/46458 



PCT/EPOl/15401 



4/24 



ABC-A-l-l 594 AYLQDWEQA IIRVLTGtek k TGV YMCQMPYpcy vddiflrvms 

ABC-A-5-2 174 TVLQASIDAA IIQLKTNVSI* WKELEStlcAV IMGETAWEI DTFPRGVIIil 

ABC-A-6-3 176 VALQTAINTA IIEITTNHFV MEEIMSVTAI TMKTLPFITK NLLHNEMFIL 

ABC-A-8-l! 178 VALQAAINAA IIEITTNHSV MEELMSVTGk nMKMHSFIGQ SC5VITDLYLF 

ABC-A-9-3 * 179 VAPQAAINAA IIEIATNHSV MEQLMSVTGV HMKILPFVAQ GGVATDFFIF 

********** ********** ********** ********** ********** 
********** ********** ******** 



ABC-A-l-l - 
ABC-A-5-2 . 
ABC-A-6-3 . 
ABC-A-8-1. 
ABC-Ar9-3 - 



638 rsinplfmtla wiysvaviik glVYEKEARL KETMRIMGLD NSILWFSWFI 

224 YDVIAFSPFG YFLAIH IVAEKEKKI KEFLKIMGLH DTAFWLSWVL 

226 FFLLHFSPLV YFISLN VTKERKKS KNLMKMMGIiQ DSAFWLSWGL 

228 SCIISFSSFI YYASVN VTRERKRM KALMTMMGLR DSAFWLSWGL 

229 FCIISFSTFI YYVSVN VTQERQYI TSLMTMMGLR ESAFWLSWGL 



********** ****** 



********* ********** ********** 
******** ********** ********** 
******** ********** 



ABC-A-l-l. 688 SSLIPLLVSA GLLWILKLG NLLPYSDPSV VPVFLSVFAV VTILQCFLIS 

ABC-A-5-2. 269 LYTSLIFLMS LLMAVIATas ILFPQSSSIV IFLLFFLYGL SSVFFALMLT 

ABC-A-6-3. 270 lYAGFIFIIS IFITIIITFT QIIVMTGFMV IFILFFLYGL SLVALVFLMS 

ABC-A-B-1*. 272 LYAGFIFIMA LFLALVIRST QFIILSGFMV VFSLFLLYGL SLVALAFLMS 

ABC-A-9-3. 273 MYAGFILIMA TLMALIVKSA QIWLTGPVM VFTLFLLYGL SLITLAFLMS 

********** ********** **********. ********** 
********** ******** ♦* ********** ********** 



ABC-A-l-l. 738 TLFSRANLAA acggllYFTL YLPYVLCV-A WQDYVGFTLK IFASLLSPVA 

ABC-A-5-2 . 319 PLFKKSKHVG IV— EFFVTV af gf iglmil LIESFPKSLV WLFSPFCHCT 

ABC-A-6-3! 320 VLLKKAVLTN LV— VFLLTL FWGCLGFT-V FYEQLPSSLE WILNICSPFA 

ABC-A-8-1. 322 ILVKKSFLTG LV— VFLLTV FWGCLGFT-S LYRHLPASLE WILSLLSPFA 

ABC-A-9-3 . 323 VLIKKPFLTG LV— VFLLIV FWQILGFP-A LYTHLPAFLE WTLCLLSPPA 

********** ** ****** ******** * ********** ********** 
********** ** ****** ******** ****** ** ♦♦***♦***♦ 



ABC-A-l-l. 787 FGFGceyfal feeqgigvqw dnlfespvee DGFNLTTSVS MMLFDTFLYG 

ABC-A-5-2. 367 FVIGIAQVMH LEDFNEGASF SNLTA GPYPLIITII MLTLNSIFYV 

ABC-A-6-3. 367 FTTGMIQIIK LDYNLNGVIP PDPSG DSYTMIATFS MLLLDGLIYL 

ABC-A-8-1. 369 FMLGMAQLLH LDYDLNSNAF PHPSD GSNLIVATNF MLAFDTCLYL 

ABC-A-9-3. 370 FTVGMAQLIH LDYDVNSNAH LDSSQ NPYLIIATLF MLVFDTLLYL 

********** ********** ***** *#******** ********** 

********** ****** ******* ********** 



2VBC-A-1-1. 837 VMTWYIEAVF PGQYGIPRPW YFPCTKSYWF geesdekshp gsngkrisei 

ABC-A-5-2 412 LLAVYLDQVI PGEFGLRRSS LYFLKPSYWS KSKRNYEELS EGNVNQTisf 

ABC-A-6-3 412 LLALYFDKIL PygderhySP LPFLNSSSCF QHQRTNAKVI EKEIDAEHPS 

ABC-A-8-1 414 AIiAIYFEKIL PNEYGHRRPP LFFLKSSFWS QTQKTDHVAI. EDEMDADPSF 

ABC-A-9-3! 415 VLTLYFDKIIi PAEYGHRCSP LFFLKSCFWP QHGRANHWL ENBTDSDPTP 

********** ********** ********** ********** ********** 
********** ********** ********* 
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ABC-A-l-l. 887 caneeepthlk lgvsi<5cilvk vyrdgmkvav d GLA liNFYEGQITS 

ABC-A-5-2. 462 seiiEPVSSE FVGKEAIRIS GIQKTYRKKG ENVEALRNLS FDIYEGQITA 

ABC-A-6-3 . 462 DDYFEPVAPE FQGKEAIRIR NVKKEYKGKS GKVEALKGLL FDIYEGQITA 

ABC-A-8-1. 464 HDSPEQAPPE FQGKEAIRIR NVTKEYKGKP DKIEALKDIiV FDIYEGQITA 

ABC-A-9-3 . 465 NDCFBPVSPB FCGKEAIRIK NLKKEYAGKC ERVEALKGW FDIYEGQITA 

********** ********** ********** ********** ********** 
****** ********** ********** ********** ********** 
******** ********** ********** ********** 

********** 



ABC-A-l-l. 931 FLGHNGAGKT TTMSILTGLF PPTSGTAYIL Gkdir— SEM STIRQNLGVC 

ABC-A-5-2. 512 LLGHSGTGKS TLMNILCGLC PPSDGFASIY GHRVSEIDEM FEARKMIGIC 

ABC-A-6-3 . 512 ILGHSGAGKS SIiLNIIiNGLS VPTEGSVTIY NKNLSEMQDL EEIRKITGVC 

ABC-A-8-1. 514 ILGHSGAGKS TLLNILSGLS VPTKGSVTIY NNKLSEMADL ENLSKLTGVC 

ABC-A-9-3 . 515 LIiGHSQAGKT TLIiNIIiSGLS VPTSGSVTVY NHTLSRMADl ENISKPTGFC 

********** ********** ********** ********** ********** 

********** ********** ********** ********** ********** 

********** ********** ********** * * ****** 
********** ********** ****** 



ABC-A-l-l. 979 PQHNVtiFDMIi TVEEHIWFYA RLKGLSEKHV KAEMEQMald vglpssklks 

ABC-A-5-2. 562 PQLDIHFDVL TVEENLSILA SIKGIpanni iQEVQKVLLD LEMQTIKDNQ 

ABC-A-6-3 . 562 PQFNVQFDIL TVKENLSLFA KIKGIHLKEV EQEVQRILIiE IiDMQNIQDNL 

ABC-A-8-1. 564 PQSNVQFDFL TVRENLRLFA KIKQILPQEV : 

ABC-A-9-3. 565 PQSNVQFGFL TVKENLRLFA KIKGILPHBV 

********** ********** ********** ********** ********** 
********** ********** ***** 
********** ********** ***** 



ABC-A-l-l. 1029 ktsqLSG^Q R2CLSVALAFV GGSK 

ABC-A-5-2. 612 AKK-LSGGQK RKLSLGIAVL GNPK 

ABC-A-6-3. 612 AKH-LSEQQK RKLTFGITIL GDPQ 

ABC-A-8-1. 594 DKE 

ABC-A-9-3. 59S EKEvqrwq elemeniqdl latjnlsggqn 

*** ****** ********** **** 
*** 
* 

ABC-A-l-l. 1053 WILDE PTAGVDPYSR RGIWELLLKY RQGRTIILST 

ABC-A-5-2. 635 ILLLDE PTAGMDPCSR HIVWNLLKYR KANRVTVFST 

ABC-A-6-3. 635 ILLLDE PTTGLDPFSR DQVWSLLRER RADHVILFST 

ABC-A-8-1. 597 IFLLDE PTAGLDPFSR HQVWNLLKER KTDRVILFST 

ABC-A-9-3, 624 rkltfgiail gdpQtVLLLDE PTAGLDPLSR HRIWNLLKEG KSDRVILFST 



****** ********** ********** ********** 

****** ********** ********** ********** 

****** ********** ********** ********** 

**** ********** ******* ** 



ABC-A-l-l. 1089 HHMDEADVLG DRIAIISHGK LCCVGSSLFL KNQLGTGYYL TLvkkdveSS 

ABC-A-5-2. 671 HFMDEADILA DRKAVISQGM LKCVGSSMFL KSKWGIGYRL SMYHKYC— 

ABC-A-6-3. 671 QsMDEADILA DRKVIMSNGR LKCAGSSMFL KRRWGLGYHL SLHrNEIC— 

ABC-A-8-1. 633 QFMDEADILA DRKVFLSQGK IiKCAGSSLFI* KKKWGIGYHL SLQLNEIC — 

ABC-A-9-3 . 674 QPIDEADILA DRKVFXSNGK LKCAGSSIiPL KKKWGIGYHL SLHLNERC— 

********** ********** ********** ********** ******** 

********** ********** ********** ********** ****** * 

* ******** ********** ********** ********** ** 

******** ********** ********** ********** ** 

******* ********** ********** ******** 
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ABC-A-l-l. 1139 Isscmssst vsylkkedsv sqsssdaglg sdhesdtlti dvsaisnlir 

ABC-A-5-2. 719 - ATESLSSLVK 

ABC-A-6-3. 719 * NPEQITSFIT 

^-ts-l 681 VEENITSLVK 

DPESITSIiVK 



ABC-A-9-3 . 722 



#********* 
********** 



ABC-A-1-1, 1189 KHVSEARLVE DIGHELTYVL PYEAakegaf velfheidDR LSDLGISSYG 

ABc-A-5-2'. 729 QHIPGATLLQ QNDQQLVYSL PFKDMDKFSG LFSAL— DS HSNLGVISYG 

ABC-A-6-3. 729 HHIPDAKTiKT ENKEKLVYTL PLERTNTFPD LPSDL— DK CSDQGVTGYD 

ABC-A-8-1! 691 QHIPDAKLSA KSEGKLIYTL PLERTNKFPE LYKDL— DS YPDLGIENY6 

ABC-A-9-3. 732 QHISDAKLTA QSEEKLVYIIi PIiERTNKPPE LYRDL— DR CSNQGIEDYG 

********** ********** ********** ***** ** ********** 

^«jt******* ********** ********** ***** ** ********** 

* ****** 



ABC-A-1-1. 1239 ISETTLEEIF LKVAEESGVD AEtsdgtlpa rmrrafgdk qsclrpfted 

ABC-A-5-2. 776 VSMTTIjEDVF LKLEVEAEID QADYSVFTQQ PLEEEMDSKS FDEMEQSLLI 

ABC-A-6-3! 776 ISMSTIiNEVF MKLEGQSTIE QDf E QVEMIRDSES LNEMEIiAHSS 

ABC-A-8-1. 738 VSMTTIiNEVF LKLEGKSTIN ESDIAILGEV QAEKADDTER LVEMEQVLSS 

ABC-A-9-3. 779 VSITTLNEVF LKLEGKSTID ESDIGIWGQL QTDGAKDIGS LVELEQVLSS 

********** ********** ********** ********** ********** 
********** ********** 
********** ********* 



ABC-A-1-1. 
ABC-A-5-2. 
ABC-A-6-3 . 
ABC-A-8-1. 
ABC-A-9-3. 



1289 daadpndsdi dpesretdll sgmdgkgsyq VKGWKLTQQQ FVAIiLWKRLL 

826 LSETKASl VSTMSLWKQQ MYTIAKFHFP 

820 FSEMQTA VSISMGLWRMQ VPAMARIiRFL 

788 LNKMRKT IGGVALWRQQ' ICAIARVRLL 

829 FHETRKT ISGVALWRQQ VCAIAKVRFL 

******* ********** ********** 

***** ********** 

7/26 



ABC-A-1-1. 1339 lARRSRKGFF AQIVLPAVFV CIALVFSLIv ppfgkypsle Iqpvmyaeqy 

ABC-A-5-2. 854 TLKRESKSVR SVLLUilFF TVQIfmflvh hsftaiawPI KLVPDLYFLK 

ABC-A-6-3! 847 KLKRQTKVLL TLIiLVFCIAI FPLIVENIIY amlnekid-W EFKNELYFLS 

;^C-A-8-l. 815 KLKHBRKALL ALLLILMAGF CPLLVEYTtlV KIYQNSYt-W ELSPH1»YPLA 

ABC-A-9-3! 856 KLKKERKSLW TILIiIiPGISP IPQLIiEHIiFY ESYQKSY-PW ELSPNTYFLS 

********** ********** ********** ******* ** ********** 
. * ********** 



ABC-A-1-1. 1389 tfvsndaped tgtlellnal tkcS^gfgtrc megnpipdtp cqageeewtt 

ABC-A-5-2. 904 PGDKP 

ABC-A-6-3. 896 PGQLP 

ABC-A-8-1. 864 PGQQP 

ABC-A-9-3. 905 PGQQP 



***** 
***** 
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ABC-A-1-1. 1439 apvpqtimdl f^gnwtinqn pspacqcssd JciWanlpvpp pgagglpppq 

ABC-A-5-2. 909 

ABC-A-6-3. 901 

ABC-A-8-1. 869 

ABC-A-9-3 . 910 



2^~A'X-X. 1489 RKQNTADILQ DLTGRNISDY LVKTYVQIlA KSLKNkiwvn efryggfslg 

ABC-A-5-2. 909 HKYKTSLLLQ NSADSDISDL ISFFTSQNIM VTMINdsdyv svap--HSAA 

;^C-A-6~3 . 901 QEPRTSLLIl NNTESNIEDF IKSLKHQNIL LEVDDFENRN GTDGLSYNGA 

ABC-A-8-1. 869 HDPLTQLIill NKTGASIDDF IQSVEHQNIA LEVDAFGTRN GTDDPSYNGA 

ABC-A-9-3. 910 QDPLTHLLVI NKTGSTIDNF LHSIjRRQNIA IEVDAFGTRN GTDDPSYNGA 

********** ********** ********** ********** ********** 
********** ********** ********** ********** ********** 



ABC-A-1-1. 1539 vsntqalpps qevndatkqjn kkhlklakds sadrflnslg rWMTGLDTBN 

ABC-A-5-2. 957 L NVMHSEKDY 

ABC-A-6-3. 951 I IVSGKQKDY 

ABC-A-8-1. 919 I TVCCNEKNY 

ABC-A-9-3. 960 I— - -IVSGDEKDH 

^ ********* 
4 ********* 



ABC-A-1-1. 1589 NVKVWFNNKG WHAISSFIiNV INNAILRanl qkgenpshyg itafnhplnl 

ABC-A-5-2. 967 VFAAVFNSTM VYSLPILVNI ISNYYLYHLN VTETIQIwst pffqeitdiv 

ABC-A-6-3. 961 RFSWCNTKR LHCFPILMNI ISNGLLQMFN HTQHIRIESS PFplshiglW 

ABC-A-8-1. 929 SFSLACNAKR UJCFFVLMDI VSNGIiIiGMVK PSVHIRTERS TPLEngqdnp 

ABC-A-9-3 . 970 RFSIACNTKR LNCFFVliLDV ISNGLLGIFN SSEHIQTDRS TPFEehrndfe^e 

********** ********** ********** ********** **** 
********** ********** ********** ********** ** 



ABC-A-1-1. 1639 tkqqlsevap mttsvdvlvs icvifamsfv pasfwfllQ ERVSKAKHLQ 

ABC-A-5-2. 1017 fkielyfqaa llgiivtamP PYFAMEN AS NHKIKAYTQL 

ABC-A-6-3. 1011 tGLPDGSFFL PLVLCSI— S PYITMGS IS DYKKNAKSQL 

ABC-A-8-1. 979 igflayimFW LVLTSSC— P PYIAMSS ID DYKNRARSQL 

ABC-A-9-3 . 1020 yGYRSNTPFW IPMAASF— T PYIAMSS IG DYKKKAHSQL 

********* ******* * ******* ** ********** 

********** 



ABC-A-1-1. 1689 PISGVKPVIY WLSNFVWDMC NYWPATLVI IIFICFQQKS Y-VSSTNLPV 

ABC-A-5-2. 1056 KLSQIiLPSAY WIGQAWDIP LFFIILILML GSLLAFHYGL Yfytvkfl-A 

ABC-A-6-3. 1048 WISQLYTSAY WCGOALVDVS FFILILLLMY LIFYIENMQY LLITSQIVFA 

ABC-A-8-1. 1016 RISGLSPSAY WFGQALVDVS IiYFLVFVFIY LMSYXSNFED MLLTIihiiq 

ABC-A-9-3 . 1057 RISGI.YPSAY WFGQALVDVS LYFLILLLMQ IMDYIFSPEE IIFIIqNLLI 

********** ********** ********** ********** ********** 
********** ********** ********* 
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ABC-A-l-l. 1738 lALLLLLYGW SITPIMYPAS FVFKipstay wltsvnlfi gingsvatfv 

ABC-A-5-2. 1105 WFCLIGYVP SVHiFTYIAS FTFKKIIiNTK EFWSFIYSVa alcIAITEIT 

ABC-A-6-3. 1098 LVJVTPGYAA SLVFFIYMIS FIPRKRRKNS GLWSFYFPfa stiinfsi— - 

ABC-A-8-1- 1066 ip-CAVGYSF SLIFMTYVIS PIFRKC3RKNS GIWSFCFW VTVFSVA 

2^C-A-9-3- 1107 QILCSIGYVS SLVFLTYVIS FIFRNGRKNS GIWSFFFLI WIFsiV 



********** ********** ********** ********* 
***« ********** ********** ******** 



ABC-A-1-1- 1788 lelftdnkln nindilksvf lifphfclgr glidiavlaiqa niadalerfge 

ABC-A-5-2. 1155 PFMGYTIATI LHYAFCIIIP lYPLLGCLIS FIKISwknvr knvdtynpwd 

ABC-A-6-3. 1145 -TLINHFDLS ILITTMVLVP SYTLLGfktf levrdqehyr efpeanfels 

ABC-A-S-l! 1111 GPAFSIFESD IPFIFTFLIP PAOMIGCLFIj SSHLlfsslf seermdvqp- 

ABC-A-9-3! 1153 aTDLNEYGFL GLFFGTMLIP PFTLIGSLFI FSEISpdsmd ylgaseseiv 



********** ********** ********** ***** 



ABC-A-1-1. 1838 nrfvsplswd Ivgrnlfama vegwfflit vliq YR FFIRP— RPV 

ABC-A-5-2. 1205 rls-VAVISP YLQCVLWIFL LQYYEKKYGQ RSIRKDPFFR nlstkskNRK 

ABC-A-6-3. 1194 atdFIiVCFIP YFQTLLFVFV LRCMELKCGK KRMRKDPVFR ISPQS— RDA 

ABC-A-8-1. 1160 FIjVFMP FI/HFIIFLFT LRCLEWKFGK KSMRKDPFFR ISPRS— SDV 

ABC-A-9-3. 1203 YLALLIP YLHFLIPLFI LRCLEMNCRK KLMRKDPVFR ISPRS— NAI 

******* ********** ********** ********** ***** *** 
** ********** ********** ******* ** ***** 



ABC-A-1-1. 1882 NAKIiSPLNDE DEDVRRERQR ILDGGGQNDI leikeltkiy 

ABC-A-5-2. 1254 LPEPPDNEDE DEDVKAERLK VKEtMGCQCC EEKPSIMVSN LHKEYDDKKD 

ABC-A-6-3 . 1242 KPNPEEPIDE DEDIQTERIR TATALTTSIL DEKPVIIASC LHKEYAGQKK 

ABC-A-8-1. 1205 CQNPEEPEGE DEDVQMERVR TANALNSTNF DEKPVIIASC LRKEYAGKRK 

ABC-A-9-3. 1248 FPWPBEPBGE EEDIQMBRMR TVNAMAVRDF DETPVIIASC LRKEYAGKKK 

********** ********** ********** ********** ********** 
*** ********** ***** ********** ********** 



ABC-A-1-1. 1922 RRKRKP AVDRICVGIP PGECFGLLGV NGAGKSSTFK 

ABC-A-5-2. 1304 FLLSRKVKKV ATKYISFCVK KGEILGLLGP NGAGKSTIIN 

ABC-A-6-3. 1292 SCFSKRKKKI AARNISFCVQ EGEILGLLGP SGAGKSSSIR 

ABC-A-8-1. 1255 GCFSKRKNKI ATRNVSFCVR KGEVLGLLGH NQAGKSTSIK 

ABC-A-9-3 . 1298 NCFSKRKKTI ATRNVSFCVK KGEVIGLLGH NGAGKSTTIK 

********** ********** ********** ********** 

********** ********** ********** ********** 

****** ********** ********** 

********* ********** 



MLTGDTTVTR 
ILVGDIEPTS 
MISGITKPTA 
VIT6DTKPTA 
MITGDTKPTA 

********** 
********** 
********** 
********** 



ABC-A-1-1. 1968 GDAFLnmsi Isnihevhq- NMGYCPQFDA ITEIiTGREH VEFFALLRGV 

ABC-A-5-2. 1354 GQVFLgdysS etseddDSLK CMGYCPQINP LWPDTTLQEH FEIYGAVKGM 

ABC-A-6-3. 1342 GEVELKGcss v LG HLGYCPQENV LWPMLTIiREH LEVYAAVKGL 

ABC-A-8-1 1305 GQVLLKGSGG G DALE FLGYCPQENA LWPNLTVRQH XjEVYAAVKGL 

ABC-A-9-3. 1348 GQVIIiKGSGG G EPLG FLGYCPQENA WJPNLTVRQH LEVYAAVKGL 

********** * **** ********** ********** ********** 

******* ** ********** ********** ********** 

it**** ********* ********** ********** 

ffiiit ********* ********** ********** 
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ABC-A-1-1. 2017 PEKEVGKVge wairklglvlc ygekyagnyS GGNKRKLSTA MALIGGPPW 
ABC-A-5-2. 1404 SASISBSKEVIS RITHALDLKE HLQKTVKKLP AGIKRKIiCFA LSMLGNPQIT 
ABC-A-6-3. 1385 RKADARLAIA RLVSAFKLHE QLNVPVQKLT AGITRKLCFV LSLLGNSPVL 
ABC-A-8-1. 1350 RRGDAEVAIT RLVDALKLQD QLKSPVKTLS EGIKRKLCFV LSXLGNPSW 
ABC-A-9-3 . 1393 RKGDAmlAIT RLVDALKLQD QLKAPVKTLS EGIKRKLRFV LSILGWPSW 

********** ********** ********** ********** ********** 
***** **** ********** ********** ********** ********** 
ititmt * ********** ******* ** 



ABC-A-1-1. 2067 FLDEPTTGMD PK3iRRFUitoC alswkegrs -WLTSHSME ECEALCTRMA 

ABC-A-5-2. 1454 LIJ)EPSTGMD PKAKQHMWRA IRTAFKNBKR AAILTTHYME EAEAVCDRVA 

ABC-A-6-3. 1435 LLDEPSTGID PTGQQQMWQA IQAWKNTER GVLLTTHNLA EAEALCDRVA 

ABC-A-8-1. 1400 LLDEPSTGMD PEGQQQMWQA IRATFRNTER GALLTTHYMA EAEAVCDRVA 

ABC-A-9-3. 1443 LLDEPSTGMD PEGQQQMWQV IRATFRNTER GALLTTHYMA EAEAVCDRVA 

********** ********** ********** ********** ********** 
********** ********** ********** ********** ********** 
********** ********** ********** ********** ********** 
********** ******** ********* ********** 

***** ********** 



ABC-A-1-1. 2116 IMVNGRPRCL GSVQHLKNRF GDGYtiwri ag—SNPDLK PVQDFFGLAF 

ABC-A-5-2. 1504 IMVSGQLRCI GTVQHLKSKF GKGYFLEIKL kdwxENLEVD RLQREIQYIF 

ABC-A-6-3 . 1485 IMVSGRLRCI GSIQHLKNKL GKDYILELKV K ETSQVT LVHTEILKLF 

ABC-A-8-1. 1450 IMVSGRLRCI GSIQHLKSKF GKDYLLEMKV KNLA— QVE PLHAEILRLF 

ABC-A-9"-3 . 1493 IMVSGRLRCI GSIQHLKSKF GKDYLLEMKL KNLA— QME PLHAEILRLF 

********** ********** ********** ********** ********** 
********** ********** ********** * *** ********** 

********** ********** ********** 
********** ********** ********* 
********** ********** * 



ABC-A-1-1. 2164 PGSVPKBKHR NMLQYQLPSS Iss-LARIFS ILSQSKKRLH lEDYSVSQTT 

ABC-A-5-2. 1554 PNASRQESFS SILAYKIPKE DVQbLSQSFF KLEEAKHAFA IEEYSFSQAT 

ABC-A-6-3. 1532 PQAAGQERYS SLLTYKLPVA DVYPLSQTFH KLEAVKHNFN LEEYSLSQCT 

ABC- A- 8-1. 1497 PQAARQERYS SLMVYKLPVE DVQPLAQAFF KLEKVKQSFD LEEYSLSQST ' 

ABC-A-9-3. 1540 PQAAQQERFS SLMVYKLPVE DVRPLSQAFF KLEIVKQSFD LEEYSLSQST 

********** ********** ********** ********** ********** 

********** ********** ********** ********** ********** 

****** ********** ********** 

** ** ********* 



ABC-A-1-1. 2213 LDQVFVNFAK DQSDDDHlkd Islhknqtw dvavltsflq delcvkesyv 

ABC-A-5-2, 1604 LEQVFVELTK EQEEEDNscg tlnstlwwer tqedrwf — 

ABC-A-6-3 . 1582 LEKVFLELSK EQEVGNFDEE IDTTMRWKLL Phsdep 

ABC-A-8-1. 1547 LEQVFLELSK EQELGDFEED FDPSVKWKLL PQEEP 

ABC-A-9-3. 1590 LEQVFLELSK EQELGDLEED FDPSVKWKLL LQEEP 

********** ********** ********** ***** 
********** ********** ********** * 
********** *** 
********** ** 
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FIGURE 6 
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FIGURE? 
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FIGURE 9 
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FIGURE 10 
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FIGURE 11 
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FIGURE 12 
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FIGURE 13 
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FIGURE 14 
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FIGURE 15 
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FIGURE 16 
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FIGURE 17 
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LISTS DE SEQUENCES 

<110> AVENTIS PHARMA 

<120> NUCLEIC ACIDS OF THE HUMAN ABCA5, ABCA6, ABCA9, ABCAIO GENES, 
VECTORS CONTAINING SUCH NUCLEIC ACIDS AND USES THEREOF 

<130> ABCA5, 6, 9, 10 

<140> ST00034 
<141> 2000-12-06 

<160> 217. 

<170> Patent In Ver, 2.1 

<210> 1 
<211> 6525 

<212> ADN . 

<213> Homo sapiens 

<400> 1 

aaaatgttga tattttctct tagcaggctg tcaaccaggt taggttcagg tcataagttt 60 
ctacccacat tctttgaact gtagttgtca ttttagttta tttttcaaaa acttttgcag 120 
tacctttttg gtctgtcttg tgtgtgcctt gcagtgaaca gtctggattt ggacagtggt 180 
ctgtctgtta gttcagtttc tcaagccttt gtcacactaa taggattgga tttatgtatg 240 
tccagcttgg gaattattac aggaattaaa aacaactttt tagagtgctt tcctgagctc 3 00 
tctttctatt tgttccccct tctacttttt gcttccctgt ggctgctgtt tctatcctcc 3 60 
agccagagag ctagtgttta ttttctccat tgtgttacac acttgtgcag ctgcaaccac 420 
catatccagg gcccaatggt aggaggtaga gaagaaaagc aaaagggatt ggcctcatcc 480 
tcttacaacg atagttccat tgaatagaga gaaaggtttt cctgcctcag agtgttggct 540 
gcactaggct tttgttactg tagtctggcc ctgttaccat gggattgctt gcatgtgggg 600 
atacaggaga attcagaaaa gaaaaaaaga tttgctattt ctacattctc cctgagcatt 660 
aagacctccc ttgcccattc ctcaattcaa agctaaggct tcttctggag ctgcctctgt 720 
gggcggttcg ggagatacca aaggagaaaa agtaccactg ttgatatggt ggtatttcaa 780 
attctggtct accctatttc acatgccttg tttacttttc agagctgaca gattgctgct' 840 
ccatgcattc tgtccagttt cctaagagag acagcttgga gtatgcttaa tccatcttac 900 
ctgggactga aacagctgct tattttgccg ttaaaaatta catgcagttt actgcgtggc 960 
tccgggtttg tttgtttgtt tttcctcttt aataggttta ttcagaaaac atgtccactg 1020 
caattaggga ggtaggagtt tggagacaga ccagaacact tctactgaag aattacttaa 1080 
ttaaatgcag aaccaaaaag agtagtgttc aggaaattct ttttccacta ttttttttat 1140 
tttggttaat attaattagc atgatgcatc caaataagaa atatgaagaa gtgcctaata 1200 
tagaactcaa tcctatggac aagtttactc tttctaatct aattcttgga tatactccag 1260 
tgactaatat tacaagcagc atcatgcaga aagtgtctac tgatcatcta cctgatgtca 1320 
taattactga agaatataca aatgaaaaag aaatgttaac atccagtctc tctaagccga 1380 
gcaactttgt aggtgtggtt ttcaaagact ccatgtccta tgaacttcgt ttttttcctg 1440 
atatgattcc agtatcttct atttatatgg attcaagagc tggctgttca aaatcatgtg 1500 
aggctgctca gtactggtcc tcaggtttca cagttttaca agcatccata gatgctgcca 1560 
ttatacagtt gaagaccaat gtttctcttt ggaaggagct ggagtcaact aaagctgtta 1620 
ttatgggaga aactgctgtt gtagaaatag atacctttcc ccgaggagta attttaatat 1680 
acctagttat agcattttca ccttttggat actttttggc aattcatatc gtagcagaaa 1740 
aagaaaaaaa aataaaagaa tttttaaaga taatgggact tcatgatact gccttttggc 1800 
tttcctgggt tcttctatat acaagtttaa tttttcttat gtcccttctt atggcagtca 1860 
ttgcgacagc ttctttgtta tttcctcaaa gtagcagcat tgtgatattt ctgctttttt 1920 
tcctttatgg attatcatct gtattttttg ctttaatgct gacacctctt tttaaaaaat 1980 
caaaacatgt gggaatagtt gaattttttg ttactgtggc ttttggattt attggcctta 2040 
tgataatcct catagaaagt tttcccaaat cgttagtgtg gcttttcagt cctttctgtc 2100 
actgtacttt tgtgattggt attgcacagg tcatgcattt agaagatttt aatgaaggtg 2160 
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cttcattttc aaatttgact gcaggcccat atcctctaat tattacaatt atcatgctca 2220 
cacttaatag tatattctat gtcctcttgg ctgtctatct tgatcaagtc attccagggg 2280 
aatttggctt acggagatca tctttatatt ttctgaagcc ttcatattgg tcaaagagta 2340 
aaagaaatta tgaggagtta tcagagggca atgttaatgg aaatattagt tttagtgaaa 2400 
ttattgagcc agtttcttca gaatttgtag gaaaagaagc cataagaatt agtggtattc 2460 
agaagacata cagaaagaag ggtgaaaatg tggaggcttt gagaaatttg tcatttgaca 2520 
tatatgaggg tcagattact gccttacttg gccacagtgg aacaggaaag agtacattga 2580 
tgaatattct ttgtggactc tgcccacctt ctgatgggtt tgcatctata tatggacaca 2640 
gagtctcaga aatagatgaa atgtttgaag caagaaaaat gattggcatt tgtccacagt 2700 
tagatataca ctttgatgtt ttgacagtag aagaaaattt atcaattttg gcttcaatca 2760 
aagggatacc agccaacaat ataatacaag aagtgcagaa ggttttacta gatttagaca 2820 
tgcagactat caaagataac caagctaaaa aattaagtgg tggtcaaaaa agaaagctgt 2880 
cattaggaat tgctgttctt gggaacccaa agatactgct gctagatgaa ccaacagctg 2940 
gaatggaccc ctgttctcga catattgtat ggaatctttt aaaatacaga aaagccaatc 3000 
gggtgacagt gttcagtact catttcatgg atgaagctga cattcttgca gataggaaag 3060 
ctgtgatatc acaaggaatg ctgaaatgtg ttggttcttc aatgttcctc aaaagtaaat 3120 
SFSrSrgsratcgg ctaccgcctg agcatgtaca tagacaaata ttgtgccaca gaatctcttt 3180 
cttcactggt taaacaacat atacctggag ctactttatt acaacagaat gaccaacaac 3240 
ttgtgtatag cttgcctttc aaggacatgg acaaattttc aggtttgttt tctgccctag 3300 
acagtcattc aaatttgggt gtcatttctt atggtgtttc catgacgact ttggaagacg 3360 
tatttttaaa gctagaagtt gaagcagaaa ttgaccaagc agattatagt gtatttactc 3420 
agcagccact ggaggaagaa atggattcaa aatcttttga tgaaatggaa cagagcttac 3480 
ttattctttc tgaaaccaag gcttctctag tgagcaccat gagcctttgg aaacaacaga 3540 
tgtatacaat agcaaagttt catttcttta ccttgaaacg tgaaagtaaa tcagtgagat 3600 
cagtgttgct tctgctttta atttttttca cagttcagat ttttatgttt ttggttcatc 3660 
actcttttaa aaatgctgtg gttcccatca aacttgttcc agacttatat tttctaaaac 3720 
ctggagacaa accacataaa tacaaaacaa gtctgcttct tcaaaattct gctgactcag 3780 
atatcagtga tcttattagc tttttcacaa gccagaacat aatggtgacg atgattaatg 3840 
acagtgacta tgtatccgtg gctccccata gtgcggcttt aaatgtgatg cattcagaaa 3900 
aggactatgt ttttgcagct gttttcaaca gtactatggt ttattcttta cctatattag 3960 
tgaatatcat tagtaactac tatctttatc atttaaatgt gactgaaacc atccagatct 4020 
ggagtacccc attctttcaa gaaattactg atatagtttt taaaattgag ctgtattttc 4080 
aagcagcttt gcttggaatc attgttactg caatgccacc ttactttgcc atggaaaatg 4140 
cagagaatca taagatcaaa gcttatactc aacttaaact ttcaggtctt ttgccatctg 4200 
catattggat tggacaagct gttgttgata tccccttatt ttttatcatt cttattttga 4260 
tgctaggaag cttactggca tttcattatg gattatattt ttatactgta aagttccttg 4320 
ctgtggtttt ttgccttatt ggttatgttc catcagttat tctgttcact tatattgctt 4380 
ctttcacctt taagaaaatt ttaaatacca aagaattttg gtcatttatc tattctgtgg 4440 
cagcgttgnc ttgtattgca atcactgaaa taactttctt tatgggatac acaattgcaa 4500 
ctattcttca ttatgccttt tgtatcatca ttccaatcta tccacttcta ggttgcctga 4560 
tttctttcat aaagatttct tggaagaatg tacgaaaaaa tgtggacacc tataatccat 4620 
gggataggct ttcagtagct gttatatcgc cttacctgca gtgtgtactg tggattttcc 4680 
tcttacaata ctatgagaaa aaatatggag gcagatcaat aagaaaagat ccctttttca 4740 
gaaacctttc aacgaagtct aaaaatagga agcttccaga accaccagac aatgaggatg 4800 
aagatgaaga tgtcaaagct gaaagactaa aggtcaaaga gctgatgggt tgccagtgtt 4860 
gtgaiggagaa accatccatt atggtcagca atttgcataa agaatatgat gacaagaaag 4920 
attttcttct ttcaagaaaa gtaaagaaag tggcaactaa atacatctct ttctgtgtga 4980 
aaaaaggaga gatcttagga ctattgggtc caaatggtgc tggcaaaagc acaattatta 5040 
atattctggt tggtgatatt gaaccaactt caggccaggt atttttagga gattattctt 5100 
cagagacaag tgaagatgat gattcactga agtgtatggg ttactgtcct cagataaacc 5160 
ctttgtggcc agatactaca ttgcaggaac attttgaaat ttatggagct gtcaaaggaa 5220 
tgagtgcaag tgacatgaaa gaagtcataa gtcgaataac acatgcactt gatttaaaag 5280 
aacatcttca gaagactgta aagaaactac ctgcaggaat caaacgaaag ttgtgttttg 5340 
ctctaagtat gctagggaat cctcagatta ctttgctaga tgaaccatct acaggtatgg 5400 
atcccaaagc caaacagcac atgtggcgag caattcgaac tgcatttaaa aacagaaagc 5460 
gggctgctat tctgaccact cactatatgg aggaggcaga ggctgtctgt gatcgagtag 5520 
ctatcatggt gtctgggcag ttaagatgta tcggaacagt acaacatcta aagagtaaat 5580 
ttggaaaagg ctactttttg gaaattaaat tgaaggactg gatagaaaac ctagaagtag 5640 
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accgccttca aagagaaatt cagtatattt tcccaaatgc aagccgtcag gaaagttttt 5700 
cttctatttt ggcttataaa attcctaagg aagatgttca gtccctttca caatcttttt 5760 
ttaagctgga agaagctaaa catgcttttg ccattgaaga atatagcttt tctcaagcaa 5820 
cattggaaca ggtttttgta gaactcacta aagaacaaga ggaggaagat aatagttgtg 5880 
gaactttaaa cagcacactt tggtgggaac gaacacaaga agatagagta gtattttgaa 5940 
tttgtattgt tcggtctgct tactgggact tctttctttt tcacttaatt ttaactttgg 6000 
tttaaaaagt tttttattgg aatggtaact ggagaaccaa gaacgcactt gaaatttttc 6060 
taagctcctt aattgaaatg ctgtggttgt gtgttttgct tttctttaaa taaaacgtat 6120 
gtataattaa gtgaagctgc atgtttgtat tgaagtatat tgaactatat agtttgtatg 6180 
tcatcttttt caccattcag aaacagtgct tctgaatttg tgatttaaag gaattgtaat 6240 
agaatagttt tatttttaag ttatctttaa gtttatgcca tcttcttaaa taagtacgta 6300 
atgttccaat ctaaataaaa aactaataca taactaatgc atagaaaaga tacataaagc 6360 
aatgtgaaag tttcttgctt ctccttttta atttctaaaa aagccacttt gaatggaagt 6420 
tgtcatccgt aaaagctgaa gtgtaagcac taggaaatct caatatagag atttgaggaa 6480 
agttatatcc actaggtggc agtcattgat cataataagt gaaat 6525 
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<211> 5296 

<212> ADN 

<213> Homo sapiens 

<400> 2 

ctgctggagt aggcacccat ttaaagaaaa 
tcgttaccta gacaaacaga gaactggttt 
tttcctgaca gttgtgttcc accatgatta 
tatgaaacag aaaagcgtgt atcagcaaac 
gaaatggagg atgaaaagag agagcttatt 
gtgtattgct ctgttttcca gttccatgag 
gaatctggga agggtagata aatttaatag 
atctaattta acccagcaga taatgaataa 
aagtgtcatt ggggcaccaa ataaaacaca 
atatgctatg ggaatcatct ttaatgaaac 
atataacagt ccactttgga aagaagattt 
gttttcatgt acattgacca aatactggaa 
taatactgcc attatagaaa tcacaaccaa 
tactgctata actatgaaga cattaccttt 
gtttatttta ttcttcttgc ttcatttctc 
aacaaaagag agaaaaaagt ctaagaattt 
attctggctc tcctggggtc taatctatgc 
tacaattatc ataacattca cccaaattat 
actctttttt ttatatggct tatctttggt 
aaagaaagct gtcctcacca atttggttgt 
gggattcact gtattttatg aacaacttcc 
tagccctttt gcctttacta ctggaatgat 
tggtgtaatt tttcctgaca cttcaggaga 
gttgcttttg gatggtctca tctacttgct 
ctatggagat gagcgccatt attctccttt 
acaccaaagg actaatgcta aggttattga 
tgattatttt gaaccagtag ctcctgaatt 
tgttaagaag gaatataaag gaaaatctgg 
tgacatatat gaaggtcaaa tcacggcaat 
actgctaaat attcttaatg gattgtctgt 
taaaaatctc tctgaaatgc aagacttgga 
tcaattcaat gttcaatttg acatactcac 
aataaaaggg attcatctaa aggaagtgga 
ggacatgcaa aacattcaag ataaccttgc 
gctgactttt gggattacca ttttaggaga 



aatgaagaag cagcaataaa gaagttgtaa 60 
tgacagtgtt tctagagtgc tttttattat 120 
ctttctcctt cagcgaatag gctaaatgaa 180 
caaagcactt ctgtgcaaga attttcttaa 240 
ggaatggggc ctctcaatac ttctaggact 300 
aaatgtccag tttcctggaa tggctcctca 360 
ctcttcttta atggttgtgt atacaccaat 420 
aacagcactt gctcctcttt tgaaaggaac 480 
catggacgaa atacttctgg aaaatttacc 540 
tttctcttat aagttaatat ttttccaggg 600 
ctcagctcat tgctgggatg gatatggtga 660 
tagaggattt gtggctttac aaacagctat 720 
tcaccctgtg atggaggagt tgatgtcagt 780 
cataactaaa aatcttcttc acaatgagat 840 
cccacttgta tat tt tat at cactcaatgt 900 
gatgaaaatg atgggtctcc aagattcagc 960 
tggcttcatc tttattattt ccatattcat 1020 
agtcatgact ggcttcatgg tcatatttat 1080 
agctttggtg ttcctgatga gtgtgctgtt 1140 
gtttctcctt accctctttt ggggatgtct 1200 
ttcatctctg gagtggattt tgaatatttg 1260 
tcagattatc aaactggatt ataacttgaa 1320 
ctcatataca atgatagcaa ctttttctat 1380 
attggcatta tactttgaca aaattttacc 1440 
atttttcttg aattcatcat cttgtttcca 1500 
gaaagaaatc gatgctgagc atccctctga 1560 
ccaaggaaaa gaagccatca gaatcagaaa 1620 
aaaagtggaa gcattgaaag gcttgctctt 1680 
cctgggtcac agtggagctg gcaaatcttc 1740 
tccaacagaa ggatcagtta ccatctataa 1800 
ggaaatcaga aagataactg gcgtctgtcc 1860 
cgtgaaggaa aacctcagcc tgtttgctaa 1920 
acaagaggta caacgaatat tattggaatt 1980 
taaacattta agtgaaggac agaaaagaaa 2040 
tcctcaaatt ttgcttttag atgaaccaac 2100 
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tactggattg gatccctttt ccagagatca 
agatcatgtg atccttttca gtacccagtc 
aaaagtgatc atgtccaatg ggagactgaa 
aaggtggggt cttggatatc acctaagttt 
aataacatcc ttcattactc atcacatccc 
aaagcttgta tatactttgc cactggaaag 
tctggataag tgttctgacc agggagtgac 
tgaagtcttt atgaaactgg aaggacagtc 
gatgataaga gactcagaaa gcctcaatga 
aatgcagaca gctgtgagtg acatgggcct 
gctccgtttc ttaaagttaa aacgtcaaac 
tggaatcgca atattccctt tgattgttga 
gatcgattgg gaatttaaaa acgaattgta 
accccgtacc agcctgttga tcatcaataa 
atcactgaag catcaaaata tacttttgga 
tgatggcctc tcatacaatg gagctatcat 
ttcagttgtg tgtaatacca agagattgca 
caatgggcta cttcaaatgt ttaatcacac 
tcctcttagc cacataggac tctggactgg 
ggttctatgt agcatttctc cttatatcac 
tgctaagtcc cagctatgga tttcaggcct 
actagtggac gtcagcttct tcattttaat 
agaaaacatg cagtaccttc ttattacaag 
tcctggttat gcagcttctc ttgtcttctt 
aaggagaaaa aacagtggcc tttggtcatt 
ttccatcact ttaatcaatc attttgacct 
tccttcatat accttgcttg gatttaaaac 
cagagaattt ccagaggcaa attttgaatt 
accctacttt cagactttgc tattcgtttt 
aaagaaaaga atgcgaaaag atcctgtttt 
gccaaatcca gaagaaccca tagatgaaga 
agccactgct ctgaccactt caatcttaga 
acacaaagaa tatgcaggcc agaagaaaag 
agcaagaaat atctctttct gtgttcaaga 
fcggtgctgga aaaagttcat ctattagaat 
agaggtggaa ctgaaaggct gcagttcagt 
gaacgtgctg tggcccatgc tgacgttgag 
ggggctcagg aaagcggacg cgaggctcgc 
gcatgagcag ctgaatgttc ctgtgcagaa 
ttttgtgctg agcctcctgg gaaactcacc 
catagacccG acagggcagc agcaaatgtg 
agagagaggt gtcctcctga ccacccataa 
tgtggccatc atggtgtctg gaaggcttag 
caaacttggc aaggattaca ttctagagct 
ggtccacact gagattctga agcttttccc 
tttgttaacc tataagctgc ccgtggcaga 
attagaagca gtgaagcata actttaacct 
ggagaaggta ttcttagagc tttctaaaga 
tgatacaaca atgagatgga aactcctccc 
tagtaatttt ttgttgatct cctataaact 
taattttaaa gatcatttaa aattaacatc 
acataaattt taaaattatt cttcctctca 
aggcaataca aaatattagt aaagtcaccc 
aaaactatat aaactt 
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agtgtggagc ctcctgagag agcgtagagc 2160 
catggatgag gctgacatcc tggctgatag 2220 
gtgtgcaggt tcttctatgt ttttgaaaag 2280 
acataggaat gaaatatgta acccagaaca 2340 
cgatgctaaa ttaaaaacag aaaacaaaga 2400 
gacaaataca tttccagatc ttttcagtga 2460 
aggttatgac atttccatgt caactctaaa 2520 
aactatcgaa caagatttcg aacaagtgga 2580 
aatggagctg gctcactctt ccttctctga 2640 
ctggagaatg caagtctttg ccatggcacg 2700 
taaagtgtta ttgaccctat tattggtatt 2760 
aaatataata tatgctatgt taaatgaaaa 2820 
ttttctctct cctggacaac ttccccagga 2880 
cacagaatca aatattgaag attttataaa 2940 
agtagatgac tttgaaaaca gaaatggtac 3000 
agtttctggt aaacaaaagg attatagatt 3060 
ctgttttcca attcttatga atattatcag 3120 
acaacatatt cgaattgagt caagcccatt 3180 
gttgccggat ggttcctttt tcttattttt 3240 
catgggcagc atcagtgatt acaagaaaaa 3300 
ctacacttct gcttactggt gtgggcaggc 3360 
tctcctttta atgtatttaa ttttctacat 3420 
ccaaattgtg tttgctttgg ttatagttac 3480 
catatatatg atatcattta tttttcgcaa 3540 
ttacttcttt tttgcctcca ccatcatgtt 3600 
aagtatattg attaccacca tggtattggt 3660 
ttttttggaa gtgagagacc aggagcacta 3720 
gagtgccact gattttctag tctgcttcat 3780 
tgttctaaga tgcatggaac taaaatgtgg 3840 
cagaatttcc ccccaaagta gagatgctaa 3900 
tgaagatatt caaacagaaa gaataagaac 3960 
tgagaaacct gttataattg ccagctgtct 4020 
ttgcttttca aagaggaaga agaaaatagc 4080 
aggtgaaatt ttgggattgc taggacccag 4140 
gatatctggg atcacaaagc caactgctgg 4200 
tttgggccac ctggggtact gccctcaaga 4260 
ggaacacctg gaggtgtatg ctgccgtcaa 4320 
catcgcaaga ttagtgagtg ctttcaaact 4380 
attaacagca ggaatcacga gaaagttgtg 4440 
tgtcttgctc ctggatgaac catctacggg 4500 
gcaggcaatc caggcagtcg ttaaaaacac 4560 
cctggctgag gcggaagcct tgtgtgaccg 4620 
atgcattggc tccatccaac acctgaaaaa 4680 
aaaagtgaag gaaacgtctc aagtgacttt 4740 
acaggctgca gggcaggaaa ggtattcctc 4800 
cgtttaccct ctatcacaga cctttcacaa 4860 
ggaagaatac agcctttctc agtgcacact 4920 
S-caggaagta ggaaattttg afcgaagaaat 4980 
tcattcagat gaaccttaaa acctcaaacc 5040 
tatgttttat gtaataatta atagtatgtt 5100 
aggtatattt tgtaaattta gttaacaaat 5160 
aacatagggg tgatagcaaa cctgtgataa 5220 
aaagagtcag gcactgggta ttgtggaaat 5280 

5296 



<210> 3 
<211> 5981 
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<400> 3 

attcacaatg aatgtgaaat taaaagcatg atgtagtagt gacccaaaag gaatgtgaat 60 
tctcctccag aacatgcaga gacccatgga tgaactgtgt ttctagattt ttcctccagc 120 
tttcctgaga gaaacaggtc aaaatgagca agagacgcat gagcgtgggt cagcaaacat 180 
gggctcttct ctgcaagaac tgtctcaaaa aatggagaat gaaaagacag accttgttgg 240 
aatggctctt ttcatttctt ctggtactgt ttctgtacct atttttctcc aatttacatc 300 
aagttcatga cactcctcaa atgtcttcaa tggatctggg acgtgtagat agttttaatg 360 
"atactaatta tgttattgca tttgcacctg aatccaaaac tacccaagag ataatgaaca 420 
aagtggcttc agccccattc ctaaaaggaa gaacaatcat ggggtggcct gatgaaaaaa 480 
gcatggatga attggatttg aactattcaa tagacgcagt gagagtcatc tttactgata 540 
ccttctccta ccatttgaag ttttcttggg gacatagaat ccccatgatg aaagagcaca 600 
gagaccattc agctcactgt caagcagtga atgaaaaaat gaagtgtgaa ggttcagagt 660 
tctgggagaa aggctttgta gcttttcaag ctgccattaa tgctgctatc atagaaatcg 720 
caacaaatca ttcagtgatg gaacagctga tgtcagttac tggtgtacat atgaagatat 780 
taccttttgt tgcccaagga ggagttgcaa ctgatttttt cattttcttt tgcattattt 840 
ctttttctac atttatatac tatgtatcag tcaatgttac acaagaaaga caatacatta 900 
cgtcattgat gacaatgatg ggactccgag agtcagcatt ctggctttcc tggggtttga 9€0 
tgtatgctgg cttcatcctt atcatggcca ctttaatggc tcttattgta aaatctgcac 1020 
aaattgtcgt cctgactggt tttgtgatgg tcttcaccct ctttctcctc tatggcctgt 1080 
ctttgataac tttagctttc ctgatgagtg tgttgataaa gaaacctttc cttacgggct 1140 
tggttgtgtt tctccttatt gtcttttggg ggatcctggg attcccagca ttgtatacac 1200 
atcttcctgc atttttggaa tggactttgt gtcttcttag cccctttgcc ttcactgttg 1260 
ggatggccca gcttatacat ttggactatg atgtgaattc taatgcccac ttggattctt 1320 
cacaaaatcc atacctcata atagctactc ttttcatgtt ggtttttgac acccttctgt 1380 
atttggtatt gacattatat tttgacaaaa ttttgcccgc tgaatatgga catcgatgtt 1440 
ctcccttgtt tttcctgaaa tcctgttttt ggtttcaaca cggaagggct aatcatgtgg 1500 
tccttgagaa tgaaacagat tctgatccta cacctaatga ctgttttgaa ccagtgtctc 1560 
cagaattctg tgggaaggaa gccatcagaa tcaaaaatct taaaaaagaa tatgcaggga 1620 
agtgtgagag agtagaagct ttgaaaggtg tggtgtttga catatatgaa ggccagatca 1680 
ctgccctcct tggtcacagt ggagctggaa aaactaccct gttaaacata cttagtgggt 1740 
tgtcagttcc aacatcaggt tcagtcactg tctataatca cacactttca agaatggctg 1800 
atatagaaaa tatcagcaag ttcactggat tttgtccaca atccaatgtg caatttggat 1860 
ttctcactgt gaaagaaaac ctcaggctgt ttgctaaaat aaaagggatt ttgccacatg 1920 
aagtggagaa agaggtacaa cgagttgtac aggaattaga aatggaaaat attcaagaca 1980 
tccttgctca aaacttaagt ggtggacaaa ataggaaact aacttttggg attgccattt 2040 
taggagatcc tcaagttttg ctattggatg aaccgactgc tggattggat cctctttcaa 2100 
ggcaccgaat atggaatctc ctgaaagagg ggaaatcaga cagagtaatt ctcttcagca 2160 
cccagtttat agatgaggct gacattctgg cggacaggaa ggtgttcata tccaatggga 2220 
agctgaagtg tgcaggctct tctctgttcc ttaagaagaa atggggcata ggctaccatt 2280 
taagtttgca tctgaatgaa aggtgtgatc cagagagtat aacatcactg gttaagcagc 2340 
acatctctga tgccaaattg acagcacaaa gtgaagaaaa acttgtatat attttgcctt 2400 
tggaaaggac aaacaaattt ccagaacttt acagggatct tgatagatgt tctaaccaag 2460 
gcattgagga ttatggtgtt tccataacaa ctttgaatga ggtgtttctg aaattagaag 2520 
gaaaatcaac tattgatgaa tcagatattg gaatttgggg acaattacaa actgatgggg 2580 
caaaagatat aggaagcctt gttgagctgg aacaagtttt gtcttccttc cacgaaacaa 2640 
ggaaaacaat cagtggcgtg gcgctctgga ggcagcaggt ctgtgcaata gcaaaagttc 2700 
gcttcctaaa gttaaagaaa gaaagaaaaa gcctgtggac tatattattg ctttttggta 2760 
ttagctttat ccctcaactt ttggaacatc tattctacga gtcatatcag aaaagttacc 2820 
cgtgggaact gtctccaaat acatacttcc tctcaccagg acaacaacca caggatcctc 2880 
tgacccattt actggtcatc aataagacag ggtcaaccat tgataacttt ttacattcac 2940 
tgaggcgaca gaacatagct atagaagtgg atgcctttgg aactagaaat ggcacagatg 3000 
acccatctta caatggtgct atcattgtgt caggtgatga aaaggatcac agattttcaa 3060 
tagcatgtaa tacaaaacgg ctgaattgct ttcctgtcct cctggatgtc attagcaatg 3120 
gactacttgg aatttttaat tcgtcagaac acattcagac tgacagaagc acattttttg 3180 
aagagcatat ggattatgag tatgggtacc gaagtaacac cttcttctgg ataccgatgg 3240 
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cagcctcttt cactccatac attgcaatga gcagcattgg tgactacaaa aaaaaagctc 3300 
attcccagct acggatttca ggcctctacc cttctgcata ctggtttggc caagcactgg 3360 
tggatgtttc cctgtacttt ttgatcctcc tgctaatgca aataatggat tatattttta 3420 
gcccagagga gattatattt ataattcaaa acctgttaat tcaaatcctg tgtagtattg 3480 
gctatgtctc atctcttgtt ttcttgacat atgtgatttc attcattttt cgcaatggga 3540 
gaaaaaatag tggcatttgg tcatttttct tcttaattgt ggtcatcttc tcgatagttg 3600 
ctactgatct aaatgaatat ggatttctag ggctattttt tggcaccatg ttaatacctc 3660 
ccttcacatt gattggctct ctattcattt tttctgagat ttctcctgat tccatggatt 3720 
acttaggagc ttcagaatct gaaattgtat acctggcact gctaatacct taccttcatt 3780 
ttctcatttt tcttttcatt ctgcgatgcc tagaaatgaa ctgcaggaag aaactaatga 3840 
gaaaggatcc tgtgttcaga atttctccaa gaagcaacgc tatttttcca aacccagaag 3900 
agcctgaagg agaggaggaa gatatccaga tggaaagaat gagaacagtg aatgctatgg 3960 
ctgtgcgaga ctttgatgag acacccgtca tcattgccag ctgtctacgg aaggaatatg 4020 
caggcaaaaa gaaaaattgc ttttctaaaa ggaagaaaac aattgccaca agaaatgtct 4080 
ctttttgtgt taaaaaaggt gaagttatag gactgttagg acacaatgga gctggtaaaa 4140 
gtacaactat taagatgata actggagaca caaaaccaac tgcaggacag gtgattttga 4200 
aagggagcgg tggaggggaa cccctgggct tcctggggta ctgccctcag gagaatgcgc 4260 
tgtggcccaa cctgacagtg aggcagcacc tggaggtgta cgctgccgtg aaaggtctca 4320 
ggaaagggga cgcaatgatc gccatcacac ggttagtgga tgcgctcaag ctgcaggacc 4380 
agctgaaggc tcccgtgaag accttgtcag agggaataaa gcgaaagctg cgctttgtgc 4440 
tgagcatcct ggggaacccg tcagtggtgc ttctggatga gccgtcgacc gggatggacc 4500 
ccgaggggca gcagcaaatg tggcaggtga ttcgggccac ctttagaaac acggagaggg 4560 
gcgccctcct gaccacccac tacatggcag aggctgaggc ggtgtgtgac cgagtggcca 4620 
tcatggtgtc aggaaggctg agatgtattg gttccatcca acacctgaaa agcaaatttg 4680 
gcaaagacta cctgctggag atgaagctga agaacctggc acaaatggag cccctccatg 4740 
cagagatcct gaggcttttc ccccaggctg ctcagcagga aaggttctcc tccctgatgg 4800 
tctataagtt gcctgttgag gatgtgcgac ctttatcaca ggctttcttc aaattagaga 4860 
tagttaaaca gagtttcgac ctggaggagt acagcctctc acagtctacc ctggagcagg 4920 
ttttcctgga gctctccaag gagcaggagc tgggtgatct tgaagaggac tttgatccct 4980 
cggtgaagtg gaaactcctc ctgcaggaag agccttaaag ctccaaatac cctatatctt 5040 
tctttaatcc tgtgactctt ttaaagataa tattttatag ccttaatatg ccttatatca 5100 
gaggtggtac aaaatgcatt tgaaactcat gcaataatta tcctcagtag tatttcttac 5160 
agtgagacaa caggcaatgt cagtgagggc gatcgtaggg cataagccta agccatacca 5220 
tgcagccttt gtgccagcaa ccaaatccca tgtttcctac tgtgttaagt ttaaaaatgc 5280 
atttattata gaattgtcta catttctgag gatgtcatgg agaatgctta attttctttc 5340 
tctgaacttc aaaatattaa atattttctt atttttttga ttaaagtata aattaagaca 5400 
ccctattgac ttccgggtaa ggggagtcaa ttgattaccc agcagcacag tatttgcttt 5460 
ttataiattcc ctttttaaat acttgttctt aattgactgg ttttcctttt ctgtcatttt 5520 
tcagagttta gattgtgagt ccatgttttg tctgttgtgc ctataaagga aatttgaaat 5580 
Gtgtatcatt ctactataaa gacacatgca cacgtatgtt tattgcagca ctgtttacaa 5640 
tagcaaagac ttggaaccaa ccaaaatacc cacaaatgat agaccggata aagaaaacgt 5700 
gacacatata caccatggaa tactatgcag ccatagaaaa ggatgagttc atattcttca 5760 
cagggacatg gatgaagctg gaaaccatca tcctcagcaa actaacacag gaacagaaaa 5820 
ccaaacaccg catgttctca ctcataagtg ggaattgaac aatgagaata catggacaca 5880 
gggaggggaa caccacaccc tggggcctgt tggggggatg ggggctaggg gagggatagc 5940 
attaggagaa atacctgatg tagatgatgg gttgatgggt g 5981 



<210> 4 
<211> 6181 
<212> 

<213> Homo sapiens 
<400> 4 

aattaatttt acttaggata agtgttgtta ttattgtttt tattgttgtt ctgttagtta 60 
ctcaaaactt cattctaatt gtgccctgag tttgttaaaa taccatactg tatttttgtg 120 
taacatgtaa ataggcatta atttttgaga aatagaaatg tttatcctta atgtattttt 180 
aatttgctaa cattgatttt ttattttctt tcctgaaata gcttatttcc taaaatgaaa 240 
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gaatttattc tcagatgaat aatttttata tcagctattc ttatcagagc aataaacaaa 300 
taccaatgat gcgctcagcc aacaattcat tacactctct gaagagtaac tggacaagga 360 
gaaaaacata gggaaaaaac caacagaatt tgttggcatg ttctacacac agaccatggc 420 
ttttcagaag ccaagctgaa taaaaacagt tttaaaagag gcaaccattt gtagaggagt 480 
ccttgaagga ttcttcattg ttttcttgga caaaaagaga ccagtggatc caagtgcttc 540 
aaatacttct ctcttatttt cttaactcta ttgctctgca atatttactt taccctgtta 600 
atgaacagga caaaatggtt aaaaaagaga taagcgtgcg tcaacaaatt caggctcttc 660 
tgtacaagaa ttttcttaaa aaatggagaa taaaaagaga gtttattgga atggacaata 720 
acattgtttc tagggctata tttgtgcatc ttttcggaac acttcagagc tacccgtttt 780 
cctgaacaac ctcctaaagt cctgggaagc gtggatcagt ttaatgactc tggcctggta 840 
gtggcatata caccagtcag taacataaca caaaggataa tgaataagat ggccttggct 900 
tcctttatga aaggaagaac agtcattggg acaccagatg aagagaccat ggatatagaa 960 
cttccaaaaa aataccatga aatggtggga gttatattta gtgatacttt ctcatatcgc 1020 
ctgaagttta attggggata tagaatccca gttataaagg agcactctga atacacagaa 1080 
cactgttggg ccatgcatgg tgaaattttt tgttacttgg caaagtactg gctaaaaggg 1140 
tttgtagctt ttcaagctgc aattaatgct gcaattatag aagtcacaac aaatcattct 1200 
gtaatggagg agttgacatc agttattgga ataaatatga agataccacc tttcatttct 1260 
aagggagaaa ttatgaatga atggtttcat tttacttgct tagtttcttt ctcttctttt 1320 
atatactttg catcattaaa tgttgcaagg gaaagaggaa aatttaagaa actgatgaca 1380 
gtaatgggtc tccgagagtc agcattctgg ctctcctggn gattgacata catttgcttc 1440 
atcttcatta tgtccatttt tatggctctg gtcataacat caatctcaat tgtatttcat lisoo 
actggcttca tggtgatatt cacactctat agcttatatg gcctttcttt gatagcattg 1560 
gctttcctca tgagtgtttt aataaggaaa cctatgctcg ctggtttggc tggatttctc 1620 
ttcactgtat tttggggatg tctgggattc actgtgttat acagacaact tcctttatct 1680 
ttgggatggg tattaagtct tcttagccct tttgccttca ctgctggaat ggcccaggtt 1740 
acacacctgg ataattactt aagtggtgtt atttttcctg atccctctgg ggattcatac 1800 
aciaatgatag ccactttttt cattttggca tttgatactc ttttctattt gatattcaca 1860 
ttatattttg agcgagtttt acctgataaa gatggccatg gggattctcc attatttttc 1920 
cttaagtcct cattttggtc caaacatcaa aatactcatc atgaaatctt tgagaatgaa 1980 
ataaatcctg agcattcctc tgatgattct tttgaaccgg tgtctccaga attccatgga 2040 
aaagaagcca taagaatcag aaatgttata aaagaatata atggaaagac tggaaaagta 2100 
gaagcattgc aaggcatatt ttttgacata tatgaaggac agatcactgc aatacttggg 2160 
cataatggag ctggtaaatc aacactgcta aacattctta gtggattgtc tgtttctaca 2220 
gaaggatcag ccactattta taatactcaa ctctctgaaa taactgacat ggaagaaatt 2280 
agaaagaata ttggattttg tccacagttc aattttcaat ttgacttcct cactgtgaga 2340 
gaaaacctca gggtatttgc taaaataaaa gggattcagc caaaggaagt ggaacaagag 2400 
gtaaaaagaa ttataatgga attagacatg caaagcattc aagacattat tgctaaaaaa 2460 
ttaagtggtg ggcagaagag aaaactaaca ctagggattg ccatcttagg agatcctcag 2520 
gttttgctgc tagatgaacc aactgctgga ttggatccct tttcaagaca ccgagtgtgg 2580 
agcctcctga aggagcataa agtagaccga cttatcctct tcagtaccca attcatggat 2640 
gaggctgaca tcttggctga taggaaagta tttctgtcta atgggaagtt gaaatgtgca 2700 
ggatcatctt tgtttctgaa gcgaaagtgg ggtattggat atcatttaag tttacacagg 2760 
aatgaaatgt gtgacacaga aaaaatcaca tcccttatta agcagcacat tcctgatgcc 2820 
aagttaacaa cagaaagtga agaaaaactt gtatatagtt tgcctttgga aaaaacgaac 2880 
aaatttccag atctttacag tgaccttgat aagtgttctg accagggcat aaggaattat 2940 
gctgtttcag tgacatctct gaatgaagta ttcttgaacc tagaaggaaa atcagcaatt 3000 
gatgaaccag attttgacat tgggaaacaa gagaaaatac atgtgacaag aaatactgga 3060 
gatgagtctg aaatggaaca ggttctttgt tctcttcctg aaacaagaaa ggctgtcagt 3120 
. agtgcagctc tctggagacg acaaatctat gcagtggcaa cacttcgctt cttaaagtta 3180 
aggcgtgaaa ggagagctct tttgtgtttg ttactagtac ttggaattgc ttttatcccc 3240 
atcattctag agaagataat gtataaagta actcgtgaaa ctcattgttg ggagttttca 3300 
cccagtatgt atttcctttc tctggaacaa atcccgaaga cgcctcttac cagcctgtta 3360 
atcgttaata atacaggatc aaatattgaa gacctcgtgc attcactgaa gtgtcaggat 3420 
atagttttgg aaatagatga ctttagaaac agaaatggct cagatgatcc ctcctacaat 3480 
ggagccatca tagtgtctgg tgaccagaag gattacagat tttctgttgc gtgtaatacc 3540 
aagaaattga attgttttcc tgttcttatg ggaattgtta gcaatgccct tatgggaatt 3600 
tttaacttca cggagcttat tcaaacggag agcacttcat tttctcgtga tgacatagtg 3660 
ctggatcttg gttttataga tgggtccata tttttgttgt tgatcacaaa ctgcgtttct 3720 
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ccttttatcg gcatgagcag catcagcgat tataaaaaaa atgttcaatc ccagttatgg 3780 
atttcaggcc tctggccttc agcatactgg tgtggacagg ctctggtgga cattccatta 3840 
tacttcttga ttctcttttc aatacattta atttactact tcatatttct gggattccag 3900 
ctttcatggg aactcatgtt tgttttggtg gtatgcataa ttggttgtgc agtttctctt 3960 
atattcctca catatgtgct ttcattcatc tttcgcaagt ggagaaaaaa taatggcttt 4020 
tggtcttttg gcttttttat tatcttaata tgtgtatcca caattatggt atcaactcaa 4080 
tatgaaaaac tcaacttaat tttgtgcatg attttcatac cttccttcac tttgctgggg 4140 
tatgtcatgt tattgatcca gctcgacttt atgagaaact tggacagtct ggacaataga 4200 
ataaatgaag tcaataaaac cattctttta acaaccttaa taccatacct tcagagtgtt 4260 
attttccttt ttgtcataag gtgtctggaa atgaagtatg gaaatgaaat aatgaataaa 4320 
gacccagttt tcagaatctc tccacggagt agagaaactc atcccaatcc ggaagagccc 4380 
gaagaagaag atgaagatgt tcaagctgaa agagtccaag cagcaaatgc actcactgct 4440 
ccaaacttgg aggaggaacc agtcataact gcaagctgtt tacacaagga atattatgag 4500 
acaaagaaaa gttgcttttc aacaagaaag aagaaaatag ccatcagaaa tgtttccttt 4560 
tgtgttaaaa aaggtgaagt tttgggatta ctaggacaca atggagctgg taaaagtact 4620 
tccattaaaa tgataactgg gtgcacaaag ccaactgcag gagtggtggt gttacaaggc 4680 
agcagagcat cagtaaggca acagcatgac aacagcctca agttcttggg gtactgccct 4740 
caggagaact cactgtggcc caagcttaca atgaaagagc acttggagtt gtatgcagct 4800 
gtgaaaggac tgggcaaaga agatgctgct ctcagtattt cacgattggt ggaagctctt 4860 
aagctccagg aacaacttaa ggctcctgtg aaaactctat cagagggaat aaagagaaag 4920 
ctgtgctttg tgctgagcat cctggggaac ccatcagtgg tgcttctaga tgagccgttc 4980 
accgggatgg accccgaggg gcagcagcaa atgtggcaga tacttcaggc taccgttaaa 5040 
aacaaggaga ggggcaccct cttgaccacc cattacatgt cagaggctga ggctgtgtgt 5100 
gaccgtatgg ccatgatggt gtcaggaacg ctaaggtgta ttggttccat tcaacatctg 5160 
aaaaacaagt ttggtagaga ttatttacta gaaataaaaa tgaaagaacc tacccaggtg 5220 
gaagctctcc acacagagat tttgaagctt ttcccacagg ctgcttggca ggaaagatat 5280 
tcctctttaa tggcgtataa gttacctgtg gaggatgtcc accctctatc tcgggccttt 5340 
ttcaagttag aggcgatgaa acagaccttc aacctggagg aatacagcct ctctcaggct 5400 
accttggagc aggtattctt agaactctgt aaagagcagg agctgggaaa tgttgatgat 5460 
aaaattgata caacagttga atggaaactt ctcccacagg aagaccctta aaatgaagaa 5520 
cctcctaaca ttcaatttta ggtcctacta cattgttagt ttccataatt ctacaagaiat 5580 
gtttcctttt acttcagtta acaaaagaaa acatttaata aacattcaat aatgattaca 5640 
gttttcattt ttaaaaattt aggatgaagg aaacaaggaa atatagggaa aagtagtaga 5700 
caaaattaac aaaatcagac atgttattca tccccaacat gggtctattt tgtgcttaaa 5760 
aataatttaa aaatcataca atattaggtt ggttttcggt tattatcaat aaagctaaca 5820 
ctgagaacat tttacaaata aaaatatgag ttttttagcc tgaacttcaa atgtatcagc 5880 
tatttttaaa cattatttac tcggattcta atttaatgtg acattgacta taagaaggtc 5940 
tgataaactg atgaaatggc acagcataac atttaattat aatgacattc tgattataaa 6000 
ataaatgcat gtgaatttta gtacatattg aagttatatg gaagaagata gccataatct 6060 
gtaagaaagt accgcagtta atattttctt tagccaactt atattcaatg tattttttat 6120 
ggatcctttt tcaaaggtag tatcagtagg catagtcatt ttctgtatct tttcacctca 6180 
c 6181 



<210> 5 . 
<211> 1642 
<212> PRT 

<2X3> Homo sapiens 
<400> 5 

Met Ser Thr Ala lie Arg Glu Val Gly Val Trp Arg Gin Thr Arg Thr 
1 5 10 15 

Leu Leu Leu Lys Asn Tyr Leu lie Lys Cys Arg Thr Lys Lys Ser Ser 
. 20 25 30 

Val Gin Glu lie Leu Phe Pro Leu Phe Phe Leu Phe Trp Leu lie Leu 
35 40 45 
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lie Ser Met Met His Pro Asn Lys Lys Tyr Glu Glu Val Pro Asn He 
50 55 60 

. Glu Leu Asn Pro Met Asp Lys Phe Thr Leu Ser Asn Leu He Leu Gly 
65 70 75 80 

Tyr Thr Pro Val Thr Asn He Thr Ser Ser He Met Gin Lys Val Ser 
85 90 95 

Thr Asp His. Leu Pro Asp Val He He Thr Glu Glu Tyr Thr Asn Glu 
100 105 110 

Lys Glu Met Leu Thr Ser Ser Leu Ser Lys Pro Ser Asn Phe Val Gly 
115 120 125 

Val Val Phe Lys Asp Ser Met Ser Tyr Glu Leu Arg Phe Phe Pro Asp 
130 135 140 

Met He Pro Val Ser Ser He Tyr Met Asp Ser Arg Ala Gly Cys Ser 
145 150 155 160 • 

Lys Ser Cys Glu Ala Ala Gin Tyr Trp Ser Ser Gly Phe Thr Val Leu 
... 165 170 175 

Gin Ala Ser He Asp Ala Ala He He Gin Leu Lys Thr Asn Val Ser 
180 185 190 

Leu Trp Lys Glu Leu Glu Ser Thr Lys Ala Val He Met Gly Glu Thr . 
195 200 205 

Ala Val Val Glu He Asp Thr Phe Pro Arg Gly Val He Leu He Tyr 
2io 215 220 

Leu Val He Ala Phe Ser Pro Phe Gly Tyr Phe Leu Ala He His He 
225 230 235 240 

Val Ala Glu Lys Glu Lys Lys He Lys Glu Phe Leu Lys He Met Gly 
245 250 255 

Leu His Asp Thr Ala Phe Trp Leu Ser Trp Val Leu Leu Tyr Thr Ser 
260 265 270 

Leu He Phe Leu Met Ser Leu Leu Met Ala Val He Ala Thr Ala Ser 
275 280 285 

Leu Leu Phe Pro Gin Ser Ser Ser He Val He Phe Leu Leu Phe Phe 
290 295 300 

Leu Tyr Gly Leu Ser Ser Val Phe Phe Ala Leu Met Leu Thr Pro Leu 
305 310 315 320 

Phe Lys Lys Ser Lys His Val Gly He Val Glu Phe Phe Val Thr Val 
325 330 335 



Ala Phe Gly Phe He Gly Leu Met He He Leu He Glu Ser Phe Pro 
340 345 350 
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Lys Ser Leu Val Trp Leu Phe Ser Pro Phe Cys His Cys Thr Phe Val 
355 360 365 

He Gly He Ala Gin Val Met His Leu Glu Asp Phe Asn Glu Gly Ala 
370 375 380 

Ser Phe Ser Asn Leu Thr Ala Gly Pro Tyr Pro Leu He He Thr He 
385 390 395 400 

He Met Leu Thr Leu Asn Ser He Phe Tyr Val Leu Leu Ala Val Tyr 
405 410 415 

Leu Asp Gin Val He Pro Gly Glu Phe Gly Leu Arg Arg Ser. Ser Leu 
420 425 430 

Tyr Phe Leu Lys Pro Ser Tyr Trp Ser Lys Ser Lye Arg Asn Tyr Glu 
435 440 445 

Glu Leu Ser Glu Gly Asn Val Asn Gly Asn He Ser Phe Ser Glu He 
450 455 460 

He Glu Pro Val Ser Ser Glu Phe Val Gly Lys Glu Ala He Arg He 
465 470 475 480 

Ser Gly He Gin Lys Thr Tyr Arg Lys Lys Gly Glu Asn Val Glu Ala 
485 490 495 

Leu Arg Asn Leu Ser Phe Asp He Tyr Glu Gly Gin He Thr Ala Leu 
500 505 510 

Leu Gly His Ser Gly Thr Gly Lys Ser Thr Leu Met Asn He Leu Cys 
515 520 525 

Gly Leu Cys Pro Pro Ser Asp Gly Phe Ala Ser He Tyr Gly His Arg 

530 535 . 540 . 

Val Ser Glu He Asp Glu Met Phe Glu Ala Arg Lys Met He Gly He 
545 550 555 560 

Cys Pro Gin Leu Asp He His Phe Asp Val Leu Thr Val Glu Glu Asn . 

565 570 575 

Leu Ser He Leu Ala Ser He Lys Gly He Pro Ala Asn Asn He He 
. 580 585 590 

Gin Glu Val Gin Lys Val Leu Leu Asp Leu Asp Met Gin Thr He Lys 
595 600 605 

Asp Asn Gin Ala Lys Lys Leu Ser Gly Gly Gin Lys Arg Lys Leu Ser 
610 615 620 

Leu Gly He Ala Val Leu Gly Asn Pro Lys He Leu Leu Leu Asp Glu 
625 630 635 640 

Pro Thr Ala Gly Met Asp Pro Cys Ser Arg His He Val Trp Asn Leu 
645 650 655 



Leu Lys Tyr Arg Lys Ala Asn Arg Val Thr Val Phe Ser Thr His Phe 
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670 



Met Asp Glu Ala Asp lie Leu Ala Asp Arg Lys Ala Val lie Ser Gin 
675 680 685 

Gly Met Leu Lys Cys Val Gly Ser Ser Met Phe Leu Lys Ser Lys Trp 
690 695 700 

Gly lie Gly Tyr Arg Leu Ser Met Tyr lie Asp Lys Tyr Cys Ala Thr 
705 710 715 720 

Glu Ser Leu Ser Ser Leu Val Lys Gin His lie Pro Gly Ala Thr Leu 
725 730 735 

Leu Gin Gin Asn Asp Gin Gin Leu Val Tyr Ser Leu Pro Phe Lys Asp 
740 745 750 

Met Asp. Lys Phe Ser Gly Leu Phe Ser Ala Leu Asp Ser His Ser Asn 
755 760 765 

Leu Gly Val lie Ser Tyr Gly Val Ser Met Thr Thr Leu Glu Asp Val 
770 775 780 

Phe Leu Lys Leu iSlu Val Glu Ala Glu lie Asp Gin Ala Asp Tyr Ser 
785 790 795 800 

Val Phe Thr Gin Gin Pro Leu Glu Glu Glu Met Asp Ser Lys Ser Phe 
805 810 815 

Asp Glu Met Glu Gin Ser Leu Leu He Leu Ser Glu Thr Lys Ala Ser 
820 825 830 

Leu Val Ser Thr Met Ser Leu Trp Lys Gin Gin Met Tyr Thr lie Ala 
835 840 845 

Lys Phe His Phe Phe Thr Leu Lys Arg Glu Ser Lys Ser Val Arg Ser 
850 .855 ' 860 

Val Leu Leu Leu Leu Leu He Phe Phe Thr Val Gin He Phe Met Phe 
865 870 875 880 



Leu Val His His Ser Phe Lys Asn Ala Val Val Pro He Lys Leu Val 
885 890 895 



Pro Asp Leu Tyr Phe Leu Lys Pro Gly Asp Lys Pro His Lys Tyr Lys 
900 905 910 

Thr Ser Leu Leu Leu Gin Asn Ser Ala Asp Ser Asp He Ser Asp Leu 
915 920 925 

lie Ser Phe Phe Thr Ser Gin Asn He Met Val Thr Met He Asn Asp 
930 935 940 

Ser Asp Tyr Val Ser Val Ala Pro His Ser Ala Ala Leu Asn Val Met 
945 950 955 960 

His Ser Glu Lys Asp Tyr Val Phe Ala Ala Val Phe Asn Ser Thr Met 
965 970 975 
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Val Tyr Sex Leu Pro lie Leu Val Asn He He Ser Asn Tyr Tyr Leu 
980 985 990 

Tyr His Leu Asn Val Thr Glu Thr He Gin He Trp Ser Thr Pro Phe 
995 1000 1005 

Phe Gin Glu He Thr Asp He Val Phe Lys He Glu Leu Tyr Phe Gin 
. 1010 1015 1020 

Ala Ala Leu Leu Gly He He Val Thr Ala Met Pro Pro Tyr Phe Ala 
1025 1030 1035 1040 

Met Glu Asn Ala Glu Asn His Lys He Lys Ala Tyr Thr Gin Leu Lys 
1045 1050 1055 

Leu Ser Gly Leu Leu Pro Ser Ala Tyr Trp He Gly Gin Ala Val Val 
1060 1065 1070 

Asp He Pro Leu Phe Phe He He Leu He Leu Met Leu Gly Ser Leu 
1075 1080 1085 

Leu Ala Phe His Tyr Gly Leu Tyr Phe Tyr Thr Val Lys Phe Leu Ala 
1090 1095 1100 

Val Val Phe Cys Leu He Gly Tyr Val Pro Ser Val He Leu Phe Thr 
1105 1110 1115 1120 

Tyr He Ala Ser Phe Thr Phe Lys Lys He Leu Asn Thr Lys Glu Phe 
1125 1130 1135 

Trp Ser Phe He Tyr Ser Val Ala Ala Leu Xaa Cys He Ala He Thr 
1140 1145 1150 

Glu He Thr Phe Phe Met Gly Tyr Thr lie Ala Thr He Leu His Tyr 
1155 1160 1165 

Ala Phe Cys He He He Pro He Tyr Pro Leu Leu Gly Cys Leu He 
1170 1175 1180 

Ser Phe He Lys He Ser Trp Lys Asn Val Arg Lys Asn Val Asp Thr 
1185 . 1190 1195 1200 

Tyr Asn Pro Trp Asp Arg Leu Ser Val Ala Val He Ser Pro Tyr Leu 
1205 1210 1215 

Gin Cys Val Leu Trp He Phe Leu Leu Gin Tyr Tyr Glu Lys Lys Tyr 
1220 1225 1230 

Gly Gly Arg Ser He Arg Lys Asp Pro Phe Phe Arg Asn Leu Ser Thr 
1235 1240 1245 

Lys Ser Lys Asn Arg Lys Leu Pro Glu Pro Pro Asp Asn Glu Asp Glu 
1250 1255 1260 

Asp Glu Asp Val Lys Ala Glu Arg Leu Lys Val Lys Glu Leu Met Gly 
1265 1270 1275 1280 
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Cys Gin Cys Cys Glu Glu Lys Pro Ser lie Met Val Ser Asn Leu His 
1285 1290 1295 

Iiys Glu Tyr Asp Asp Lys Lys Asp Phe Leu Leu Ser Arg Lys Val Lys 
1300 1305 1310 

Lys Val Ala Thr Lys Tyr lie Ser Phe Cys Val Lys Lys Gly Glu lie 
1315 1320 1325 

Leu Gly Leu Leu Gly Pro Asn Gly Ala Gly Lys Ser Thr lie He Asn 
1330 1335 1340 

He Leu Val Gly Asp He Glu Pro Thr Ser Gly Gin Val Phe Leu Gly 
1345 , 1350 1355 1360 

Asp Tyr Ser Ser Glu Thr Ser Glu Asp Asp Asp Ser Leu Lys Cys Met 
1365 1370 1375 

Gly Tyr Cys Pro Gin He Asn Pro Leu Trp Pro Asp Thr Thr Leu Gin 
1380 1385 1390 

Glu His Phe Glu He Tyr Gly Ala Val Lys Gly Met Ser Ala Ser Asp 
1395 1400 1405 

Met Lys Glu Val He Ser Arg He Thr His Ala Leu Asp Leu Lys Glu 
1410 . 1415 1420 

His Leu Gin Lys Thr Val Lys Lys Leu Pro Ala Gly He Lys Arg Lys 
1425 1430 1435 1440 

Leu Cys Phe Ala Leu Ser Met Leu Gly Asn Pro Gin He Thr Leu Leu 
1445 1450 1455 

Asp Glu Pro Ser Thr Gly Met Asp Pro Lys Ala Lys Gin His Met Trp 
1460 1465 1470 

Arg Ala He Arg Thr Ala Phe Lys Asn Arg Lys Arg Ala Ala He Leu 
1475 1480 1485 

Thr Thr His Tyr Met Glu Glu Ala Glu Ala Val Cys Asp Arg Val Ala 
1490 1495 1500 

He Met Val Ser Gly Gin Leu Arg Cys He Gly Thr Val Gin His Leu 
1505 1510 1515 1520 

Lys Ser Lys Phe Gly Lys Gly Tyr Phe Leu Glu He Lys Leu Lys Asp 
1525 1530 1535 

Trp He Glu Asn Leu Glu Val Asp Arg Leu Gin Arg Glu He Gin Tyr 
1540 1545 1550 

He Phe Pro Asn Ala Ser Arg Gin Glu Ser Phe Ser Ser He Leu Ala 
1555 1560 1565 

Tyr Lys He Pro Lys Glu Asp Val Gin Ser Leu Ser Gin Ser Phe Phe 
1570 1575 1580 



Lys Leu Glu Glu Ala Lys His Ala Phe Ala lie Glu Glu Tyr Ser Phe 
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1600 



Ser Gin Ala Thr Leu Glu Gin Val Phe Val Glu Leu Thr Lys Glu Gin 
1605 1610 1615 

Glu Glu Glu Asp Asn Ser Cys Gly Thr Leu Asn Ser Thr Leu Trp Trp 
1620 1625 1630 

Glu Arg Thr Gin Glu Asp Arg Val Val Phe 
1635 1640 



<210> 6 
<211> 1617 
<212> PRT 

<213> Homo sapiens 

<400> 6 ' . 

Met Asn Met Lys Gin Lys Ser Val Tyr Gin Gin Thr Lys Ala Leu Leu 
1 . 5 . ' 10 15 . 

Cys Lys Asn Phe Leu Lys Lys Trp Arg Met Lys Arg Glu Ser Leu Leu 
.20 25 30 

Glu Trp Gly Leu Ser He Leu Leu Gly Leu Cys He Ala Leu Phe Ser 
35 40 45 

Ser Ser Met Arg Asn Val Gin Phe Pro Gly Met Ala Pro Gin Asn Leu 

. 50 * ^ -55 60 

Gly Arg Val Asp Lys Phe Asn Ser Ser Ser Leu Met Val Val Tyr Thr 

65 ■ 70 75 80 

Pro , He Ser Asn Leu Thr Gin Gin He Met Asn Lys Thr Ala Leu Ala 
85 90 95 . 

Pro Leu Leu Lys Gly Thr Ser Val He Gly Ala Pro Asn Lys Thr His 
100 105 110 

Met Asp Glu He Leu Leu Glu Asn Leu Pro Tyr Ala Met Gly He He 
115 120 125 

Phe Asii Glu Thr Phe Ser Tyr Lys Leu He Phe Phe Gin Gly Tyr Asn 
130 .135 140 

Ser Pro Leu Trp Lys Glu Asp Phe Ser Ala His Cys Trp Asp Gly Tyr 

145 150 155 160 



Gly Glu Phe Ser Cys Thr Leu Thr Lys Tyr Trp Asn Arg Gly Phe Val 
165 170 175 

Ala Leu Gin Thr Ala He Asn Thr Ala. He He Glu He Thr Thr Asn 
. 180 185 190 

His Pro Val Met Glu Glu Leu Met Ser Val Thr Ala He Thr Met Lys 
195 200 205 
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Thr Leu Pro Phe lie Thr Lys Asn Leu Leu His Asn Glu Met Phe He 
210 . 215 220 

Leu Phe Phe Leu Leu His Phe Ser Pro Leu Val Tyr Phe He Ser Leu 
225 230 235 240 

Asn Val Thr Lys Glu Arg Lys Lys Ser Lys Asn Leu Met Lys Met Met 
245 250 255 

Gly Leu Gin Asp Ser Ala Phe Trp Leu Ser Trp Gly Leu He Tyr Ala 
260 265 270 

Gly Phe He Phe He He Ser He Phe He Thr He He He Thr Phe 
275 280 285 

thr Gin He He Val Met Thr Gly Phe Met Val He Phe He Leu Phe 
290 295 300 

Phe Leu Tyr Gly Leu Ser Leu Val Ala Leu Val Phe Leu Met Ser Val 
305 310 315 320 

Leu Leu Lys Lys Ala Val Leu Thr Asn Leu Val Val Phe Leu Leu Thr 
325 330 335 

Leu Phe Trp Gly Cys Leu Gly Phe Thr Val Phe Tyr Glu Gin Leu Pro 
340 345 350 

Ser Ser Leu Glu Trp He Leu Asn He Cys Ser Pro Phe Ala Phe Thr 
355 .360 365 

Thr Gly Met He Gin He He Lys Leu Asp Tyr Asn Leu Asn Gly Val 
370 375 380 

He Phe Pro Asp Pro Ser Gly Asp Ser Tyr Thr Met He Ala Thr Phe 
385 390 395 400 

Ser Met Leu Leu Leu Asp Gly Leu He Tyr Leu Leu Leu Ala Leu Tyr 
405 410 415 

Phe Asp Lys He Leu Pro Tyr Gly Asp Glu Arg His Tyr Ser Pro Leu 
420 425 . 430 

Phe Phe Leu Asn Ser Ser Ser Cys Phe Gin His Gin Arg Thr Asn Ala 
435 440 445 

Lys Val He Glu Lys Glu He Asp Ala Glu His Pro Ser Asp Asp Tyr 
450 455 460 

Phe Glu Pro Val Ala Pro Glu Phe Gin Gly Lys Glu Ala He Arg He 
465 470 475 480 

Arg Asn Val Lys Lys Glu Tyr Lys Gly Lys Ser Gly Lys Val Glu Ala 
485 490 495 

Leu Lys Gly Leu Leu Phe Asp He Tyr Glu Gly Gin He Thr Ala He 
500 505 510 



Leu Gly His Ser Gly Ala Gly Lys Ser Ser Leu Leu Asn He Leu Asn 
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515 520 525 

Gly Leu Ser Val Pro Thr Glu Gly Ser Val Thr He Tyr Asn Lys Asn 
530 535 540 

Leu Ser Glu Met Gin Asp Leu Glu Glu He Arg Lys lie Thr Gly Val 
545 550 555 560 

Cys Pro Gin Phe Asn Val Gin Phe Asp He Leu Thr Val Lys Glu Asn 
565 570 575 

Leu Ser Leu Phe Ala Lys He Lys Gly He His Leu Lys Glu Val Glu 
580 585 590 

Gin Glu Val Gin Arg He Leu Leu Glu Leu Asp Met Gin Asn He Gin 
595 . 600 605 

Asp Asn Leu Ala Lys His Leu Ser Glu Gly Gin Lys Arg Lys Leu Thr 
610 ' 615 620 

Phe Gly lie Thr He Leu Gly Asp Pro Gin He Leu Leu Leu Asp Glu 
625 630 635 640 

Pro Thr Thr Gly Leu Asp Pro Phe Ser Arg Asp Gin Val Trp Ser Leu 
645 650 655 

Leu Arg Glu Arg Arg Ala Asp His Val He Leu Phe Ser Thr Gin Ser 
660 665 670 

Met Asp Glu Ala Asp He Leu Ala Asp Arg Lys Val He Met Ser Asn 
675 680 685 

Gly Arg Leu Lys Cys Ala Gly Ser Ser Met Phe Leu Lys Arg Arg Trp 
690 695 700 

Gly Leu Gly Tyr His Leu Ser Leu His Arg Asn Glu He Cys Asn Pro 
705 . 710 715 720 

Glu Gin He Thr Ser Phe He Thr His His He Pro Asp Ala Lys Leu 
725 730 735 

Lys Thr Glu Asn Lys Glu Lys Leu Val Tyr Thr Leu Pro Leu Glu Arg 
740 745 750 

Thr Asn Thr Phe . Pro Asp Leu Phe Ser Asp Leu Asp Lys Cys Ser Asp 
755 760 765 

Gin Gly Val Thr Gly Tyr Asp He Ser Met Ser Thr Leu Asn Glu Val 
770 775 780 

Phe Met Lys Leu Glu Gly Gin Ser Thr He Glu Gin Asp Phe Glu Gin 
785 790 795 800 

Val Glu Met He Arg Asp Ser Glu Ser Leu Asn Glu Met Glu Leu Ala 
805 810 815 



His Ser Ser Phe Ser Glu Met Gin Thr Ala Val Ser Asp Met Gly Leu 
820 825 830 
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Trp Arg Met Gin Val Phe Ala Met Ala Arg Leu Arg Phe Leu Lys Leu 
835 840 845 

Lys Arg Gin Thr Lys Val Leu Leu Thr Leu Leu Leu Val Phe Gly He 
850 855 860 

Ala He Phe Pro Leu He Val Glu Asn He He Tyr Ala Met Leu Asn 
865 870 875 880 

Glu Lys He Asp Trp Glu Phe Lys Asn Glu Leu Tyr Phe Leu Ser Pro 
885 890 895 

Gly Gin Leu Pro Gin Glu Pro Arg Thr Ser Leu Leu He He Asn Asn 
900 905 910 

Thr Glu Ser Asn He Glu Asp Phe He Lys Ser Leu Lys His Gin Asn 
915 920 925 

He Leu Leu Glu Val Asp Asp Phe Glu Asn Arg Asn Gly Thr Asp Gly 
930 935 940 



Leu Ser' Tyr Asn Gly Ala He He Val Ser Gly Lys Gin Lys Asp Tyr 
945 950 955 960 



Arg Phe Ser Val Val Cy& Asn Thr Lys Arg Leu His Cys Phe Pro He 
965 970 975 

Leu Met Asn He He Ser Asn Gly Leu Leu Gin Met Phe Asn His Thr 
980 985 990 

Gin' His lie Arg He Glu Ser Ser Pro Phe Pro Leu Ser His He Gly 
"995 1000 1005 

Leu Trp Thr Gly Leu Pro Asp Gly Ser Phe Phe Leu Phe Leu Val Leu 
1010. 1015 1020 

Cys Ser He Ser Pro Tyr He Thr Met Gly Ser He Ser. Asp Tyr. Lys 
1025 1030 1035 1040 

Lys Asn Ala Lys Ser Gin Leu Trp He Ser Gly Leu Tyr Thr Ser Ala 
1045 1050 1055 

Tyr Trp Cys Gly Gin Ala Leu Val Asp Val Ser Phe Phe He Leu He 
1060 1065 1070 

Leu Leu Leu Met Tyr Leu He Phe Tyr He Glu Asn Met Gin Tyr Leu 
1075 1080 1085 



Leu He Thr Ser Gin He Val Phe Ala Leu Val He Val Thr Pro Gly 
1090 1095 1100 

Tyr Ala IVla Ser Leu Val Phe Phe He Tyr Met He Ser Phe He Phe 
1105 . 1110 1115 1120 

Arg Lys Arg Arg Lys Asn Ser Gly Leu Trp Ser Phe Tyr Phe Phe Phe 
1125 1130 1135 
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Ala Ser Thr lie Met Phe Ser lie Thr Leu He Asn His Phe Asp Leu 
1140 1145 1150 

Ser He Leu He Thr Thr Met Val Leu Val Pro Ser Tyr Thr Leu Leu 
1155 1160 1165 

Gly Phe Lys Thr Phe Leu Glu Val Arg Asp Gin Glu His Tyr Arg Glu 
1170 1175 1180 

Phe Pro Glu Ala Asn Phe Glu Leu Ser Ala Thr Asp Phe Leu Val Cys 
1185 1190 1195 1200 

Phe He Pro Tyr Phe Gin Thr Leu Leu Phe Val Phe Val Leu Arg Cys 
1205 1210 1215 

Met Glu Leu Lys Cys Gly Lys Lys Arg Met Arg Lys Asp Pro Val Phe 
1220 1225 1230 

Arg He Ser Pro Gin Ser Arg Asp Ala Lys Pro Asn Pro Glu Glu Pro 
1235 1240 1245 

He Asp Glu Asp Glu Asp He Gin Thr Glu Arg He Arg Thr Ala Thr 
1250 1255 1260 

Ala Leu Thr Thr Ser He Leu Asp Glu Lys Pro Val He He Ala Ser 
1265 1270 1275 1280 

Cys Leu His Lys Glu Tyr Ala Gly Gin Lys Lys Ser Cys Phe Ser Lys 
1285 1290 1295 

Arg Lys Lys Lys He Ala Ala Arg Asn He Ser Phe Cys Val Gin Glu 
1300 1305 1310 

Gly Glu He Leu Gly Leu Leu Gly Pro Ser Gly Ala Gly Lys Ser Ser 
1315 1320 1325 

Ser He Arg Met He Ser Gly He Thr Lys Pro Thr Ala Gly Glu Val 
1330 1335 1340 

Glu Leu Lys Gly Cys Ser Ser Val Leu Gly His Leu Gly Tyr Cys Pro 
1345 1350 1355 1360 

Gin Glu Asn Val Leu Trp Pro Met Leu Thr Leu Arg Glu His Leu Glu 
.1365 1370 1375 

Val Tyr Ala Ala Val Lys Gly Leu Arg Lys Ala Asp Ala Arg Leu Ala 
1380 1385 1390 

He Ala Arg Leu Val Ser Ala Phe Lys Leu His Glu Gin Leu Asn Val 
1395 1400 1405 

Pro Val Gin Lys Leu Thr Ala Gly He Thr Arg Lys Leu Cys Phe Val 
1410 1415 1420 

Leu Ser Leu Leu Gly Asn Ser Pro Val Leu Leu Leu Asp Glu Pro Ser 
1425 1430 1435 1440 



Thr Gly He Asp Pro Thr Gly Gin Gin Gin Met Trp Gin Ala He Gin 
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1445 1450 1455 

Ala Val Val Lys Asn Thr Glu Arg Gly Val Leu Leu Thr Thr His Asn 
1460 1465 1470 

Leu Ala Glu Ala Glu Ala Leu Cys Asp Arg Val Ala He Met Val Ser 
1475 1480 1485 

Gly Arg Leu Arg Cys lie Gly Ser lie Gin His Leu Lys Asn Lys Leu 
. 1490 1495 1500 

Gly Lys Asp Tyr He Leu Glu Leu Lys Val Lys Glu Thr Ser Gin Val 
1505 1510 1515 1520 

Thr Leu Val His Thr Glu He Leu Lys Leu Phe Pro Gin Ala Ala Gly 
1525 1530 1535 

Gin Glu Arg Tyr Ser Ser Leu Leu Thr Tyr Lys Leu Pro Val Ala Asp 
1540 1545 1550 

Val Tyr Pro Leu Ser Gin Thr Phe His Lys Leu Glu Ala Val Lys His 
1555 1560 1565 

Asn Phe Asn Leu Glu Glu Tyr Ser Leu Ser Gin Cys Thr Leu Glu Lys 
1570 1575 1580 

Val Phe Leu Glu Leu Ser Lys Glu Gin Glu Val Gly Asn Phe Asp Glu 
1585 1590 1595 1600 

Glu He Asp Thr Thr Met Arg Trp Lys Leu Leu Pro His Ser Asp Glu 
1605 1610 1615 

Pro 



<210> 7 
<211> 1624 
<212> PRT 

<213> Homo sapiens 
<400> 7 

Met Ser Lys Arg Arg Met Ser Val Gly Gin Gin Thr Trp Ala Leu Leu 
I S 10 15 

Cys Lys Asn Cys Leu Lys Lys Trp Arg Met Lys Arg Gin Thr Leu Leu 
20 25 30 

Glu Trp Leu Phe Ser Phe Leu Leu Val Leu Phe Leu Tyr Leu Phe Phe 
35 40 45 

Ser Asn Leu His Gin Val His Asp Thr Pro Gin Met Ser Ser Met Asp 
50 55 60 



Leu Gly Arg Val Asp Ser Phe Asn Asp Thr Asn Tyr Val He Ala Phe 
65 70 75 80 
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Ala Pro Glu Ser Lys Thr Thr Gin Glu lie Met Asn Lys Val Ala Ser 
85 90 95 

Ala Pro Phe Leu Lys Gly Arg Thr lie Met Gly Trp Pro Asp Glu Lys 
100 105 110 

Ser Met Asp Glu Leu Asp Leu Asn Tyr Ser lie Asp Ala Val Arg Val 
115 120 125 

lie Phe Thr Asp Thr Phe Ser Tyr His Leu Lys Phe Ser Trp Gly His 
130 . 135 140 

Arg He Pro Met Met Lys Glii His Arg Asp His Ser Ala His Cys Gin 
145 150 155 160 

Ala Val Asn Glu Lys Met Lys Cys Glu Gly Ser Glu Phe Trp Glu Lys 
165 170 175 

Gly Phe Val Ala Phe Gin Ala Ala He Asn Ala Ala He lie Glu He 
180 * 185 190 

Ala Thr Asn His Ser Val Met Glu Gin Leu Met Ser Val Thr Gly Val 
195 200 205 

His Met Lys He Leu Pro Phe Val Ala Gin Gly Gly Val Ala Thr Asp 
210 215 220 

Phe Phe He Phe Phe Cys He He Ser Phe Ser Thr Phe He Tyr Tyr 
225 230 235 240 

Val Ser Val Asn Val Thr Gin Glu Arg Gin Tyr He Thr Ser Leu Met 
245 250 255 

Thr Met Met Giy Leu Arg Glu Ser Ala Phe Trp Leu Ser Trp Gly Leu 
260 265 270 

Met Tyr Ala Gly Phe He Leu He Met Ala Thr Leu Met Ala Leu He 
275 280 . 285 

' Val Lys Ser Ala Gin He Val Val Leu Thr Gly Phe Val Met Val Phe 
290 295 300 

Thr Leu Phe Leu Leu Tyr Gly Leu Ser Leu He Thr Leu Ala Phe Leu 
305 310 315 320 

Met Ser Val Leu He Lys Lys Pro Phe Leu Thr Gly Leu Val Val Phe 
325 330 335 

Leu Leu He Val Phe Trp Gly He Leu Gly Phe Pro Ala Leu Tyr Thr 
340 345 350 

His Leu Pro Ala Phe Leu Glu Trp Thr Leu Cys Leu Leu Ser Pro Phe 
355 360 365 

Ala Phe Thr Val Gly Met Ala Gin Leu He His Leu Asp Tyr Asp Val 
370 375 380 



Asn Ser Asn Ala His Leu Asp Ser Ser Gin Asn Pro Tyr Leu He He 
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400 



Ala Thr Leu Phe Met Leu Val Phe Asp Thr Leu Leu Tyr Leu Val Leu 
405 410 415 

Thr Leu Tyr Phe Asp Lys lie Leu Pro Ala Glu Tyr Gly His Arg Cys 
420 425 430 

Ser Pro Leu Phe Phe Leu Lys Ser Cys Phe Trp Phe Gin His Gly Arg 
435 440 445 

Ala Asn His Val Val Leu Glu Asn Glu Thr Asp Ser Asp Pro Thr Pro 
450 455 . 460 

Asn Asp Cys Phe Glu Pro Val Ser Pro Glu Phe Cys Gly Lys Glu Ala 
465 470 475 480 

lie Arg He Lys Asn Leu Lys Lys Glu Tyr Ala Gly Lys Cys Glu Arg 
485 490 495 . 

Val Glu Ala Leu Lys Gly Val Val Phe Asp He Tyr Glu Gly Gin He 
500 505 510 

Thr Ala Leu Leu Gly His Ser Gly Ala Gly Lys Thr Thr Leu Leu Asn 
515 520 525 

He Leu Ser Gly Leu Ser Val Pro Thr Ser Gly Ser Val Thr Val Tyr 
530 535 540 

Asn His Thr Leu Ser Arg Met Ala Asp He Glu Asn He Ser Lys Phe 
545 550 555 560 

Thr Gly Phe Cys Pro Gin Ser Asn Val Gin Phe Gly Phe Leu Thr Val 
565 570 575 

Lys Glu Asn Leu Arg Leu Phe Ala Lys He Lys Gly He Leu Pro His 
580 585 590 

Glu Val Glu Lys Glu Val Gin Arg Val Val Gin Glu Leu Glu Met Glu 
595 600 605 

Asn He Gin Asp He Leu Ala Gin Asn Leu Ser Gly Gly Gin Asn Arg 
610 615 620 

Lys Leu Thr Phe Gly He Ala He Leu Gly Asp Pro Gin Val Leu Leu 
625 630 635 640 

Leu Asp Glu Pro Thr Ala Gly Leu Asp Pro Leu Ser Arg His Arg He 
645 650 655 

Trp Asn Leu Leu Lys Glu Gly Lys Ser Asp Arg Val He Leu Phe Ser 
660 665 670 

Thr Gin Phe He Asp Glu Ala Asp He Leu Ala Asp Arg Lys Val Phe 
675 680 685 



He Ser Asn Gly Lys Leu Lys Cys Ala Gly Ser Ser Leu Phe Leu Lys 
690 695 700 
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Lys Lys Trp Gly lie Gly Tyr His Leu Ser Leu His Leu Asn Glu Arg 
705 710 715 720 

Cys Asp Pro Glu Ser lie Thr Ser Leu Val Lys Gin His lie Ser Asp 
725 730 735 

Ala Lys Leu Thr Ala Gin Ser Glu Glu Lys Leu Val Tyr lie Leu Pro 
740 745 750 

Leu Glu Arg Thr Asn Lys Phe Pro Glu Leu Tyr Arg Asp Leu Asp Arg 
755 760 765 

Cys Ser Asn Gin Gly lie Glu Asp Tyr Gly Val Ser lie Thr Thr Leu 
770 775 780 

Asn Glu Val Phe Leu Lys Leu Glu Gly Lys Ser Thr lie Asp Glu Ser 
785 790 795 800 

Asp lie Gly lie Trp Gly Gin Leu Gin Thr Asp Gly Ala Lys Asp lie 
.805 810 815 

Gly Ser Leu Val Glu Leu gIu Gin Val Leu Ser Ser Phe His Glu Thr 
820 825 830 

Arg Lys Thr He Ser Gly Val Ala Leu Trp Arg Gin Gin Val Cys Ala 
835 840 845 

He Ala Lys Val Arg Phe Leu Lys Leu Lys Lys Glu Arg Lys Ser Leu 
850 * 855 860 

Tzp Thr He Leu Leu Leu Phe Gly He Ser Phe He Pro Gin Leu Leu 

865 . . . 870 875 880 

Glii His Leu Phe Tyr Glu Ser Tyr Gin Lys Ser Tyr Pro Trp Glu Leu 
885 890 895 

Ser Pro Asn Thr Tyr Phe Leu Ser Pro Gly Gin Gin Pro Gin Asp Pro 
900 905 910 

Leu Thr His Leu Leu Val He Asn Lys Thr Gly Ser Thr He Asp Asn 
915 920 925 

Phe Leu His Ser Leu Arg Arg Gin Asn He Ala He Glu Val Asp Ala 
930 935 940 

Phe Gly Thr Arg Asn Gly Thr Asp Asp Pro Ser Tyr Asn Gly Ala He 
945 950 955 960 

He Val Ser Gly Asp Glu Lys Asp His Arg Phe Ser He Ala Cys Asn 
965 970 975 

Thr Lys Arg Leu Asn Cys Phe Pro Val Leu Leu Asp Val He Ser Asn 
980 985 990 



Gly Leu Leu Gly He Phe Asn Ser Ser Glu His He Gin Thr Asp Arg 
995 1000 1005 
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Ser Thr Phe Phe Glu Glu His Met Asp Tyr Glu Tyr Gly Tyr Arg Ser 
1010 1015 1020 

Asn Thr Phe Phe Trp He Pro Met Ala Ala Ser Phe Thr Pro Tyr He 
1025 1030 1035 1040 

Ala Met Ser Ser He Gly Asp Tyr Lys Lys Lys Ala His Ser Gin Leu 
1045 1050 1055 



Arg He Ser Gly Leu Tyr Pro Ser Ala Tyr Trp Phe Gly Gin Ala Leu 
1060 1065 1070 

Val Asp Val Ser Leu Tyr Phe Leu He Leu Leu Leu Met Gin He Met 
1075 1080 1085 

Asp Tyr He Phe Ser Pro Glu Glu He He Phe He He Gin Asn Leu 
i090 1095 1100 

Leu He Gin He Leu Cys Ser He Gly Tyr Val Ser Ser Leu Val Phe 
1105 1110 1115 1120 

Leu Thr Tyr Val He Ser Phe He Phe Arg Asn Gly Arg Lys Asn Ser 
1125 1130 1135 

Gly He Trp Ser Phe Phe Phe Leu He Val Val He Phe Ser He Val 
1140 1145 . 1150 



Ala Thr Asp Leu Asn Glu Tyr Gly 
1155 1160 

Met Leu He Pro Pro Phe Thr Leu 
1170 1175 

Glu He Ser Pro Asp Ser Met Asp 
1185 1190 



Phe Leu Gly Leu Phe Phe Gly Thr 
1165 

He Gly Ser Leu Phe He Phe Ser 
1180 

Tyr Leu Gly Ala Ser Glu Ser Glu 
1195 1200 



He Val Tyr Leu Ala Leu Leu He Pro Tyr Leu His Phe Leu He Phe 
1205 1210 1215 

Leu Phe He Leu Arg Cys Leu Glu Met Asn Cys Arg Lys Lys Leu Met 
1220 1225 1230 



Arg Lys Asp Pro Val Phe Arg He Ser Pro Arg Ser Asn Ala He Phe 
1235 1240 1245 

Pro Asn Pro Glu Glu Pro Glu Gly Glu Glu Glu Asp He Gin Met Glu 
. 1250 1255 1260 

Arg Met Arg Thr Val Asn Ala Met Ala Val Arg Asp Phe Asp Glu Thr 
1265 1270 1275 1280 

Pro Val He He Ala Ser. Cys Leu Arg Lys Glu Tyr Ala Gly Lys Lys 
1285 1290 1295 

Lys Asn Cys Phe Ser Lys Arg Lys Lys Thr He Ala Thr Arg Asn Val 
1300 1305 1310 



Ser Phe Cys VslL Lys Lys Gly Glu Val He Gly Leu Leu Gly His Asn 
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1315 1320 1325 

Gly Ala Gly Lys Ser Thr Thr He Lys Met He Thr Gly Asp Thr Lys 
1330 1335 1340 

Pro Thr Ala Gly Gin Val He Leu Lys Gly Ser Gly Gly Gly Glu Pro 
1345 1350 1355 1360 

Leu Gly Phe Leu Gly Tyr Cys Pro Gin Glu Asn Ala Leu Trp Pro Asn 
1365 1370 1375 

Leu Thr Val Arg Gin His Leu Glu Val Tyr Ala Ala Val Lys Gly Leu 
1380 1385 1390 

Arg Lys Gly Asp Ala Met He Ala He Thr Arg Leu Val Asp Ala Leu 
1395 1400 1405 

Lys Leu Gin Asp Gin Leu Lys Ala Pro Val Lys Thr Leu Ser Glu Gly 
1410 1415 1420 

He Lys Arg Lys Leu Arg Phe Val Leu Ser He Leu Gly Asn Pro Ser 
1425 1430 1435 1440 

Val Val Leu Leu Asp Glu Pro Ser Thr Gly Met Asp Pro Glu Gly Gin 
1445 1450 1455 

Gin Gin Met Trp Gin Val He Arg Ala Thr Phe Arg Asn Thr Glu Arg 
1460 1465 1470 

Gly Ala Leu Leu Thr Thr His Tyr Met Ala Glu Ala Glu Ala Val Cys 
1475 1480 1485 

Asp Arg Val Ala lie Met Val Ser Gly Arg Leu Arg Cys He Gly Ser 
. 1490 1495 1500 

He Gin His Leu Lys Ser Lys Phe Gly Lys Asp Tyr Leu Leu Glu Met 
1505 1510 1515 1520. 

Lys Leu Lys Asn Leu Ala Gin Met Glu Pro Leu His Ala Glu He Leu 
1525 1530 1535 

Arg Leu Phe Pro Gin Ala Ala Gin Gin Glu Arg Phe Ser Ser Leu Met 
1540 1545 1550 

Val Tyr Lys Leu Pro Val Glu Asp Val Arg Pro Leu Ser Gin Ala Phe 
, 1555 1560 1565 

Phe Lys Leu Glu He Val Lys Gin Ser Phe Asp Leu Glu Glu Tyr Ser 
1570 1575 1580 

Leu Ser Gin Ser Thr Leu Glu Gin Val Phe Leu Glu Leu Ser Lys Glu 
1585 1590 1595 1600 

Gin Glu Leu Gly Asp Leu Glu Glu Asp Phe Asp Pro Ser Val Lys Trp 
1605 1610 1615 



Lys Leu Leu Leu Gin Glu Glu Pro 
. 1620 
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<210> 8 
<2il> 1543 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Met Asn Lys Met Ala Leu Ala Ser Phe Met Lys Gly Arg Thr Val He 
1 5 . 10 15 

Gly Thr Pro Asp Glu Glu Thr Met Asp He Glu Leu Pro Lys Lys Tyr 
.20 25 30 

His Glu Met Val Gly Val He Phe Ser Asp Thr Phe Ser Tyr Arg Leu 
35 40 45 

Lys Phe Asn Trp Gly Tyr Arg He Pro Val He Lys Glu His Ser Glu 
50 55 60 

Tyr rhr. Glu His Cys Trp Ala Met His Gly Glu He Phe Cys Tyr Leu 
65 70 75 80 

Ala Lys Tyr Trp Leu Lys Gly Phe Val Ala Phe Gin Ala Ala He Asn 
85 90 95 

Ala Ala He He Glu Val Thr Thr Asn His Ser Val Met Glu Glu Leu 
100 . .105 110 

Thr Ser Val He Gly He Asn. Met Lys He Pro Pro Phe He Ser Lys 
115 120 125 

Gly Glu He Met Asn Glu Trp Phe His Phe Thr Cys Leu Val Ser Phe 
130 135 140 

Ser Ser Phe He Tyr Phe Ala Ser Leu Asn Val Ala Arg Glu Arg Gly 
145 150 155 160 

Lys Phe Lys Lys Leu Met Thr Val Met Gly Leu Arg Glu Ser Ala Phe 
165 170 175 

Trp Leu Ser Trp Xaa Leu Thr Tyr He Cys Phe He Phe He Met Ser 
180 185 190 

He Phe Met Ala Leu Val He Thr Ser He Ser He Val Phe His Thr 
195 200 205 

Gly Phe Met Val He Phe Thr Leu Tyr Ser Leu Tyr Gly Leu Ser Leu 
210 215 220 

lie Ala Leu Ala Phe Leu Met Ser Val Leu He Arg Lys Pro Met Leu 

225 230 235 240 

Ala Gly Leu Ala Gly Phe Leu Phe Thr Val Phe Trp Gly Cys Leu Gly 
245 250 255 



Phe Thr Val Leu Tyr Arg Gin Leu Pro Leu Ser Leu Gly Trp Val Leu 
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260 265 270 

Ser Leu Leu Ser Pro Phe Ala Phe Thr Ala Gly Met Ala Gin Val Thr 
275 280 285 

His Leu Asp Asn Tyr Leu Ser Gly Val lie Phe Pro Asp Pro Ser Gly 
290 295 300 

Asp Ser Tyr Lys Met lie Ala Thr Phe Phe He Leu Ala Phe Asp Thr 
305 310 315 320 

Leu Phe Tyr Leu He Phe Thr Leu Tyr Phe Glu Arg Val Leu Pro Asp 
325 330 335 

Lys Asp Gly His Gly Asp Ser Pro Leu Phe Phe Leu Lys Ser Ser Phe 
340 345 350 

Trp Ser Lys His Gin Asn Thr His His Glu He Phe Glu Asn Glu He 
355 360 365 

Asn Pro Glu His Ser Ser Asp Asp Ser Phe Glu Pro Val Ser Pro Glu 
370 375 380 

Phe His Gly Lys Glu Ala He Arg He Arg Asn Val He Lys Glu Tyr 
385 ' 390 395 400 

Asn Gly Lys Thr Gly Lys Val Glu Ala Leu Gin Gly He Phe Phe Asp 
405 410 415 

He Tyr Glu Gly Gin He Thr Ala He Leu Gly His Asn Gly Ala Gly 
420 425 430 

Lys Ser Thr Leu Leu Asn He Leu Ser Gly Leu Ser Val Ser Thr Glu 
435 . 440 445 

Gly Ser Ala Thr He Tyr Asn Thr Gin Leu Ser Glu He Thr Asp Met 
450 455 460 

Glu Glu He Arg Lys Asn He Gly Phe Cys Pro Gin Phe Asn Phe Gin 
465 . 470 475 480 

Phe Asp Phe Leu Thr Val Arg Glu Asn Leu Arg Val Phe Ala Lys He 
485 490 495 

Lys Gly lie Gin Pro Lys Glu Val Glu Gin Glu Val Lys Arg He He 
500 505 510 

Met Glu Leu Asp Met Gin Ser He Gin Asp He He Ala Lys Lys Leu 
515 520 525 

Ser Gly Gly Gin Lys Arg Lys Leu Thr Leu Gly He Ala He Leu Gly 
. 530 535 540 

Asp Pro Gin Val Leu Leu Leu Asp Glu Pro Thr Ala Gly Leu Asp Pro 
545 . . 550 555 560 



Phe Ser Arg His Arg Val Trp Ser Leu Leu Lys Glu His Lys Val Asp 
565 570 575 
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Arg Leu He Leu Phe Ser Thr Gin Phe Met Asp Glu Ala Asp He Leu 
580 585 590 

Ala Asp Arg Lys Val Phe Leu Ser Asn Gly Lys Leu Lys Cys Ala Gly 
595 600 605 

Ser Ser Leu Phe Leu Lys Arg Lys Trp Gly He Gly Tyr His Leu Ser 
610 615 620 

Leu His Arg Asn Glu Met Cys Asp Thr Glu Lys lie Thr Ser Leu He 
625 630 . 635 640 

Lys Gin His He Pro Asp Ala Lys Leu Thr Thr Glu Ser Glu Glu Lys 
645 650 655 

Leu Val Tyr Ser Leu Pro Leu Glu Lys Thr Asn Lys Phe Pro Asp Leu 
660 665 670 

Tyr Ser Asp Leu Asp Lys Cys Ser Asp Gin Gly He Arg Asn. Tyr Ala 
675 680 685 

Val Ser Val Thr Ser Leu Asn Glu Val Phe Leu Asn Leu Glu Gly Lys 
690 695 700 

Ser Ala He Asp Glu Pro Asp Phe Asp lie Gly Lys Gin Glu Lys He 
705 710 715 720 

His Val Thr Arg Asn Thr Gly Asp Glu Ser Glu Met Glu Gin Val Leu 
725 730 735 

Cys Ser Leu Pro Glu Thr Arg Lys Ala Val Ser Ser Ala Ala Leu Trp 
740 745 750 

Arg Arg Gin He Tyr Ala Val Ala Thr Leu Arg Phe Leu Lys Leu Arg 
755 760 765 

Arg Glu Arg Arg Ala Leu Leu Cys Leu Leu Leu Val Leu Gly He Ala 
770 775 780 

Phe He Pro He He Leu Glu Lys He Met Tyr Lys Val Thr Arg Glu 
785 790 795 800 

Thr His Cys Trp Glu Phe Ser Pro Ser Met Tyr Phe Leu Ser Leu Glu 
805 810 815 

Gin He Pro Lys Thr Pro Leu Thr Ser Leu Leu He Val Asn Asn Thr 
820 825 830 

Gly Ser Asn He Glu Asp Leu Val His Ser Leu Lys Cys Gin Asp He 
835 840 845 

Val Leu Glu He Asp Asp Phe Arg Asn Arg Asn Gly Ser Asp Asp Pro 
850 855 860 

Ser Tyr Asn Gly Ala He He Val Ser Gly Asp Gin Lys Asp Tyr Arg 
865 870 . 875 880 
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Phe Ser Val Ala Cys Asn Thr Lya Lys Leu Asn Cys Phe Pro Val Leu 
885 890 895 

Met Gly He Val Ser Asn Ala Leu Met Gly He Phe Asn Phe Thr Glu 
900 905 910 

Leu He Gin Thr Glu Ser Thr Ser Phe Ser Arg Asp Asp He Val Leu 
915 920 925 

Asp Leu Gly Phe He Asp Gly Ser He Phe Leu Leu Leu He Thr Asn 
.930 935 940 

Cys Val Ser Pro Phe He Gly Met Ser Ser He Ser Asp Tyr Lys Lys 
945 950 955 960 

Asn Val Gin Ser Gin Leu Trp He Ser Gly Leu Trp Pro Ser Ala Tyr 
965 970 975 

Trp Cys Gly Gin Ala Leu Val Asp He Pro Leu Tyr Phe Leu He Leu 
. 980 985 990 

Phe Ser He His Leu He Tyr Tyr Phe He Phe Leu Gly Phe Gin Leu 
995 1000 1005 

Ser Trp Glu Leu Met Phe Val Leu Val Val Cys He He Gly Cys Ala 
1010 1015 1020 

Val Ser Leu He Phe Leu Thr Tyr Val Leu Ser Phe He Phe Arg Lys 
1025 1030 . 1035 1040 

Trp Arg Lys Asn Asn Gly Phe Trp Ser Phe Gly Phe Phe He He Leu 
. 1045 1050 1055 

He Cys Val Ser Thr He Met Val Ser Thr Gin Tyr Glu Lys Leu Asn 
1060 1065 1070 

Leu He Leu Cys Met He Phe He Pro Ser Phe Thr Leu Leu Gly Tyr 
1075 ■ 1080 1085 

Val Met Leu Leu He Gin Leu Asp Phe Met Arg Asn Leu Asp Ser Leu 
1090 1095 1100 

Asp Asn Arg He Asn Glu Val Asn Lys Thr He Leu Leu Thr Thr Leu 
1105 1110 1115 1120 

He Pro Tyr Leu Gin Ser Val He Phe Leu Phe Val He Arg Cys Leu 
1125 il30 1135 

Glu Met Lys Tyr Gly Asn Glu He Met Asn Lys Asp Pro Val Phe Arg 
1140 1145 1150 

He Ser Pro Arg Ser Arg Glu Thr His Pro Asn Pro Glu Glu Pro Glu 
1155 " 1160 1165 

Glu Glu Asp Glu Asp Val Gin Ala Glu Arg Val Gin Ala Ala Asn Ala 
1170 1175 1180 



Leu Thr Ala Pro Asn Leu Glu Glu Glu Pro Val He Thr Ala Ser Cys 
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1195 



PCT/EPOl/15401 
1200 



Leu His Lys Glu Tyr Tyr Glu Thr Lys Lys Ser Cys Phe Ser Thr Arg 
1205 1210 1215 

Lys Lys Lys lie Ala lie Arg Asn Val Ser Phe Cys Val Lys Lys Gly 
1220 1225 1230 

Glu Val Leu Gly Leu Leu Gly His Asn Gly Ala Gly Lys Ser Thr Ser 
1235 1240 1245 



He Lys Met He Thr Gly Cys Thr Lys Pro Thr Ala Gly Val Val Val 
1250 1255 1260 

Leu Gin Gly Ser Arg Ala Ser Val Arg Gin Gin His Asp Asn Ser Leu 
1265 . 1270 1275 1280 

Lys Phe Leu Gly Tyr Cys Pro Gin Glu Asn Ser Leu Trp' Pro Lys Leu 
1285 1290 1295 

Thr Met Lys Glu His Leu Glu Leu Tyr Ala Ala Val Lys Gly Leu Gly 
1300 1305 1310 

Lys Glu Asp Ala Ala Leu Ser He Ser Arg Leu Val Glu Ala Leu Lys 
1315 1320 1325 

Leu Gin Glu Gin Leu Lys Ala Pro Val Lys Thr Leu Ser Glu Gly He 

.1330 . ' 1335 1340 

Lys Arg Lys Leu Cys Phe Val Leu Ser He Leu Gly Asn Pro Ser Val 
1345 1350 1355 1360 

Val Leu Leu Asp Glu Pro Phe Thr Gly Met Asp Pro Glu Gly Gin Gin 
1365 1370 1375 

Gin Met Trp Gin He Leu Gin Ala Thr Val Lys Asn Lys Glu Arg Gly 
1380 1385 1390 

Thr Leu Leu Thr Thr His Tyr Met Ser Glu Ala Glu Ala Val Cys Asp 
1395 1400 1405 

Arg Met Ala Met Met Val Ser Gly Thr Leu Arg Cys He Gly Ser He 
1410 1415 1420 

Gin His Leu Lys Asn Lys Phe Gly Arg Asp Tyr Leu Leu Glu He Lys 
1425 1430 1435 1440 



Met Lys Glu Pro Thr 
1445 

Leu Phe Pro Gin Ala 
1460 

Tyr Lys Leu Pro Val 
1475 

Lys Leu Glu Ala Met 
1490 



Gin Val Glu Ala Leu 
1450 

Ala Trp Gin Glu Arg 
1465 

Glu Asp Val His Pro 
1480 

Lys Gin Thr Phe Asn 
• 1495 



His Thr Glu He Leu Lys 
1455 

Tyr Ser Ser Leu Met Ala 
1470 

Leu Ser Arg Ala Phe Phe 
1485 

Leu Glu Glu Tyr Ser Leu 
1500 
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Ser Gin Ala Thr Leu Glu Gin Val Phe Leu Glu Leu Cys Lys Glu Gin 
1505 1510 1515 1520 

Glu Leu Gly Asn Val Asp Asp Lys lie Asp Thr Thr Val Glu Trp Lys 
1525 1530 1535 

Leu Leu Pro Gin Glu Asp Pro 
1540 



<210> 9 
<211> 130 
, <212> ADN 
<213> Homo sapiens 

<400> 9 

ctgctggagt aggcacGcat ttaaagaaaa aatgaagaag cagcaataaa gaagttgtaa 60 
tcgttaccta gacaaacaga gaactggttt tgacagtgtt tctagagtgc tttttattat 120 
tttcctgaca 130 



<210> 10 

<211> 141 

<212> ADN 

<213> Homo sapiens 

<400> 10 

gttgtgttcc . accatgatta ctttctcctt cagcgaatag gctaaatgaa tatgaaacag 60 
aaaagcgtgt atcagcaaac caaagcactt ctgtgcaaga attttcttcia gaaatggagg 120 
.atgaaaagag agagcttatt g 141 



<210> 11 
<2H> 205 
<212> ADN 

<213> Homo sapiens _ . 

<400> 11 

gaatggggcc tctcaatact tctaggactg tgtattgctc tgttttccag ttccatgaga 60 
aatgtccagt ttcctggaat ggctcctcag aatctgggaa gggtagataa atttaatagc 120 
tcttctttaa tggttgtgta tacaccaata tctaatttaa cccagcagat aatgaataaa 180 
acagcacttg ctcctctttt gaaag 205 



<210> 12 
<211> 159 
<212> ADN 

<213> Homo sapiens 

. <400> 12 

gaacaagtgt cattggggca ccaaataaaa cacacatgga cgaaatactt ctggaaaatt 60 
taccatatgc tatgggaatc atctttaatg aaactttctc ttataagtta atatttttcc 120 
agggatataa cagtccactt tggaaagaag atttctcag 159 



<210> 13 
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<211> 104 
<212> ADN 

<213> Homo sapiens 
<400> 13 

ctcattgctg ggatggatat ggtgagtttt catgtacatt gaccaaatac tggaatagag 60 
gatttgtggc tttacaaaca gctattaata ctgccattat agaa 104 



<210> 14 
<211> 227 
<212> ADN 

■ <213> Homo sapiens 
<406> 14 

atcacciacca atcaccckgt gatggaggag 
acattacctt tcataactaa aaatcttctt 
cttcatttct ccccacttgt atattttata 
tctaagaatt tgatgaaaat gatgggtctc 



ttgatgtcag ttactgctat aactatgaag 60 
cacaatgaga tgtttatttt attcttcttg 120 
tcactcaatg taacaaaaga gagaaaaaag 180 
caagattcag cattctg 227 



<210> 15 
<211>'l42 
<212> ADN 

<213> Homo sapiens 
<400> 15 

gctctcctgg ggtctaatct atgctggctt catctttatt atttccatat tcattacaat 60 
tatcataaca ttcacccaaa ttatagtcat gactggcttc atggtcatat ttatactctt 120 
ttttttatat ggcttatctt tg 142 

<210> 16 

<211> 186 

<212> ADN 

<213> Homo sapiens 

<400> 16 - 

gtagctttgg tgttcctgat gagtgtgctg ttaaagaaag ctgtcctcac caatttggtt 60 
gtgtttctcc ttaccctctt ttggggatgt ctgggattca ctgtatttta tgaacaactt 120 
ccttcatctc tggagtggat tttgaatatt tgtagccctt ttgcctttac tactggaatg 180 
attcag 186 



<210> 17 
<211> 148 

<212> ADN 

<213> Homo sapiens 
<400> 17 

attatcaaac tggattataa cttgaatggt gtaatttttc ctgacccttc aggagactca 60 
tatacaatga tagcaacttt ttctatgttg cttttggatg gtctcatcta cttgctattg 120 
gcattatact ttgacaaaat tttaccct 148 



<210> 18 
.<211> 169 
<212> ADN 
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<400> 18 

atggagatga gcgccattat tctcctttat ttttcttgaa ttcatcatct tgtttccaac 60 
accaaaggac taatgctaag gttattgaga aagaaatcga tgctgagcat ccctctgatg 120 
attattttga accagtagct cctgaattcc aaggaaaaga agccatcag 169 



<210> 19 
<211> 59 
<212> ADN 

<213> Homo sapiens 
<400> 19 

aatcagaaat gttaagaagg aatataaagg a.aaatctgga aaagtggaag cattgaaag 59 



<210> 20 

<2ii>.iii 

<212> ADN 

<213> Homo sapiens 
<4p0> 20 

gcttgctctt tgacatatat gaaggtcaaa tcacggcaat cctgggtcac agtggagctg 60 
gcaaatcttc actgctaaat attcttaatg gattgtctgt tccaacagaa g 111 



<210> 21 
<211> 176 
<212> ADN 

<213> Homo sapiens 

<400> 21 

gatcagttac catctataat aaaaatctct ctgaaatgca agacttggag gaaatcagaa 60 
agataactgg cgtctgtcct caattcaatg ttcaatttga catactcacc gtgaaggaaa 120 
acctcagcct gtttgctaaa ataaaaggga ttcatctaaa ggaagtggaa caagag 176 



<210> 22 . * 
<211> 120 
<212> ADN 

<213> Homo sapiens 
<400> 22 

gtacaacgaa tattattgga attggacatg caaaacattc aagataacct tgctaaacat 60 
ttaagtgaag gacagaaaag aaagctgact tttgggatta ccattttagg agatcctcaa 120 



<210> 23 
<211> 139 
<212> ADN 

<213> Homo sapiens 
<400> 23 

attttgcttt tagatgaacc aactactgga ttggatccct tttccagaga tcaagtgtgg 60 
agcctcctga gagagcgtag agcagatcat gtgatccttt tcagtaccca gtccatggat 120 
gaggctgaca tcctggctg 139 
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<210> 24 

<211> 91 

<212> ADN 

<213> Homo sapiens 

<400> 24 

atagaaaagt gatcatgtcc aatgggagac tgaagtgtgc aggttcttct atgtttttga 60 
aaagaaggtg gggtcttgga tatcacctaa g 91 



<210>.25 
<211> 140 

<212> ADN 

<213> Homo sapiens 

<400> 25 . 

tttacatagg aatgaaatat gtaacccaga acaaataaca tccttcatta ctcatcacat 60 
ccccgatgct aaattaaaaa cagaaaacaa agaaaagctt gtatatactt tgccactgga 120 
aaggacaaat acatttccag 140 



<210> 26 

<211> 117 

<212> ADN 

<213> Homo sapiens 

<400> 26 

atcttttcag tgatctggat aagtgttctg accagggagt gacaggttat gacatttcca 60 
tgtcaactct aaatgaagtc tttatgaaac tggaaggaca gtcaactatc gaacaag 117 



<210> 27 

<211> 184 

<212> ADN 

<213> Homo sapiens 

<400> 27 

atttcgaaca agtggagatg ataagagact cagaaagcct caatgaaatg gagctggctc 60 
actcttcctt ctctgaaatg cagacagctg tgagtgacat gggcctctgg agaatgcaag 120 
tctttgccat ggcacggctc cgtttcttaa agttaaaacg tcaaactaaa gtgttattga 180 
ccct 184 



<210> 28 
<211> 167 

<212> ADN 

<213> Homo sapiens 

<400> 28 

attattggta tttggaatcg caatattccc tttgattgtt gaaaatataa tatatgctat 60 

gttaaatgaa aagatcgatt gggaatttaa aaacgaattg tattttctct ctcctggaca 120 
acttccccag gaaccccgta ccagcctgtt gatcatcaat aacacag 167 



<210> 29 
<211> 134 
<212> ADN 
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<400> 29 

aatcaaatat tgaagatttt ataaaatcac tgaagcatca aaatatactt ttggaagtag 60 
atgactttga aaacagaaat ggtactgatg gcctctcata caatggagct atcatagttt 120 
ctggtaaaca aaag 134 



<210> 30 

<211> 138 

<212> ADN 

<213> Homo sapiens 

<400> 30 

gattatagat tttcagttgt gtgtaatacc aagagattgc actgttttcc aattcttatg 60 
aatattatca gcaatgggct acttcaaatg tttaatcaca cacaacatat tcgaattgag 120 
tcaagcccat ttcctctt 138 



<210> 31 . 

<211> 108 

<212> ADN 

<213> Homo sapiens 

<400> 31 

agccacatag gactctggac tgggttgccg gatggttcct ttttcttatt tttggttcta 60 
tgtagcattt ctccttatat caccatgggc agcatcagtg attacaag 108 



<210> 32 

<211> 174 

<2125> ADN 

<213> Homo sapiens 

<460> 32 

aaaaatgcta agtcccagct atggatttca ggcctctaca cttctgctta ctggtgtggg 60 
caggcactag tggacgtcag cttcttcatt ttaattctcc ttttaatgta tttaattttc 120 
tacatagaaa acatgcagta ccttcttatt acaagccaaa ttgtgtttgc tttg 174 



<210> 33 

<211> 114 
<212> ADN 

<213> Homo sapiens 
<400> 33 

gttatagtta ctcctggtta tgcagcttct cttgtcttct tcatatatat gatatcattt 60 
atttttcgca aaaggagaaa aaacagtggc ctttggtcat tttacttctt tttt 114 



<210> 34 

<211> 120 

<212> ADN 

<213> Homo sapiens 

<400> 34 

gcctccacca tcatgttttc catcacttta atcaatcatt ttgacctaag tatattgatt 60 
accaccatgg tattggttcc ttcatatacc ttgcttggat ttaaaacttt tttggaagtg 120 
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<210> 35 
<211> 78 
<212> ADN 

<213> Homo sapiens 
<400> 35 

agagaccagg agcactacag agaatttcca gaggcaaatt ttgaattgag tgccactgat 60 
tttctagtct gcttcata 78 

<210> 36 

<211> 92 

<212> ADN 

<2l3> Homo sapiens 

<400>. 36 . 

ccctactttc agactttgct attcgttttt gttctaagat gcatggaact aaaatgtgga 60 
aagaaaagaa tgcgaaaaga tcctgttttc ag 92 



<210> 37 
<211> 121 
<212> ADN 

<213> Homo sapiens 
<400> 37 

aatttccccc caaagtagag atgctaagcc aaatccagaa gaacccatag atgaagatga 60 
agatattcaa acagaaagaa taagaacagc cactgctctg accacttcaa tcttagatga 120 
g 121 



<210> 38 

<211> 118 . . 

<212> ADN 

<213> Homo sapiens 
<400> 38 , 

aaacctgtta taattgccag ctgtctacac aaagaatatg caggccagsia gaaaagttgc 60 
ttttcaaaga ggaagaagaa aatagcagca agaaatatct ctttctgtgt tcaagaag 118 

<210> 39 
<2li>-92 
<212> ADN 

<213> Homo. sapiens 
<400> 39 

gtgaaatttt gggattgcta ggacccagtg gtgctggaaa aagttcatct attagaatga 60 
tatctgggat cacaaagcca actgctggag ag 92 



<210> 40 . 

<211> 155 

<212> ADN . 

<213> Homo sapiens 
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<400> 40 

gtggaactga aaggctgcag ttcagttttg ggccacctgg ggtactgccc tcaagagaac 60 
gtgctgtggc ccatgctgac gttgagggaa cacctggagg tgtatgctgc cgtcaagggg 120 
ctcaggaaag cggacgcgag gctcgccatc gcaag 155 



<210> 41 
<211> 76 

<212> ADN 

<213> Homo sapiens 
<400> 41 

attagtgagt gctttcaaac tgcatgagca gctgaatgtt cctgtgcaga aattaacagc 60 
aggaatcacg agaaag 76 



<210> 42 

<211> 95 

<212> ADN 

<213> Homo sapiens 

<406> 42 ; , 

ttgtgttttg tgctgagcct cctgggaaac tcacctgtct tgctcctgga tgaaccatct 60 
. acgggcatag accccacagg gcagcagcaa at'gtg 95 



<210> 43 

<211> 120 

<212> ADN 

<213> Homo sapiens 

<400> 43 ' 

gcaggcaatc caggcagtcg ttaaaaacac agagagaggt gtcctcctga ccacccataa 60 
cctggctgag gcggaagcct tgtgtgaccg tgtggccatc atggtgtctg gaaggcttag 120 



<210> 44 

<211> 141 

<212>,ADN 

<213> Homo sapiens 

<400> 44 

atgcattggc tccatccaac acctgaaaaa caaacttggc aaggattaca ttctagagct 60 
aaaagtgaag gaaacgtctc aagtgacttt ggtccacact gagattctga agcttttccc 120 
acaggctgca gggcaggaaa g 141 



<210> 45 

<211> 80 

<212> AHN 

<213> Homo sapiens 

<400> 45 

gtattcctct ttgttaacct ataagctgcc cgtggcagac gtttaccctc tatcacagac 60 
. ctttcacaaa ttagaagcag 80 



<210> 46 
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<211> 56 

<212> ADN 

<213> Homo sapiens 

<400> 46 

tgaagcataa ctttaacctg gaagaataca gcctttctca gtgcacactg gagaag 56 



<210> 47 
<2ll> 369 
<212> ADN 

<213> Homo sapiens 
<400> 47 

gtattcttag agctttctaa agaacaggaa gtaggaaatt ttgatgaaga aattgataca 60 
acaatgagat ggaaactcct ccctcattca gatgaacctt aaaacctcaa acctagtaat 120 
tttttgttga tctcctataa acttatgttt tatgtaataa ttaatagtat gtttaatttt 180 
aaagatcatt taaaattaac atcaggtata ttttgtaaat ttagttaaca aatacataaa 240 
ttttaaaatt attcttcctc tcaaacatag gggtgatagc aaacctgtga taaaggcaat 300 
acaaaatatt agtaaagtca cccaaagagt caggcactgg gtattgtgga aataaaacta 360 
tataaactt 369 



<210> 48 
<211> 130 
<212> ADN 

^213 > Homo sapiens 

<400> 48 

attcacaatg aatgtgaaat taaaagcatg atgtagtagt gacccaaaag gaatgtgaat 60 
tctcctccag aacatgcaga gacccatgga tgaactgtgt ttctagattt ttcctccagc 120 
tttcctgaga 130 



<210> 49 . 
<211> 109 
<212> ADN 
<213> Homo sapiens 

<400> 49 

gaaacaggtc aaaatgagca agagacgcat gagcgtgggt cagcaaacat gggctcttct 60 
ctgcaagaac tgtctcaaaa aatggagaat gaaaagacag accttgttg 109 



<210>,50 
<211> 208 

<212> ADN 

<213> Homo sapiens 

<400> 50 

gaatggctct tttcatttct tctggtactg tttctgtacc tatttttctc caatttacat 60 
caagttcatg acactcctca aatgtcttca atggatctgg gacgtgtaga tagttttaat 120 
gatactaatt atgttattgc atttgcacct gaatccaaaa ctacccaaga gataatgaac 180 
aaagtggctt cagccccatt cctaaaag 208 



<2'l0> 51 
<211> 165 



wo 02/46458 PCT/EPO 1/15401 

38/66 

<212> ADN 

<213> Homo sapiens 
<400> 51 

gaagaacaat catggggtgg cctgatgaaa aaagcatgga tgaattggat ttgaactatt 60 
caatagacgc agtgagagtc afcctttactg ataccttctc ctaccatttg aagttttctt 120 
ggggacatag aatccccatg atgaaagagc acagagacca ttcag 165 



<210> 52 
<211> 104 
<212> ADN 

<213> Homo sapiens 
<400> 52 

ctcactgtca agcagtgaat gaaaaaatga agtgtgaagg ttcagagttc tgggagaaag 60 
gctttgtagc ttttcaagct gccattaatg ctgctatcat agaa 104 



<210> 53 • 
<211> 227 
<212> 

<213> Homo sapiens 
<400> 53 

atcgcaacaa atcattcagt gatggaacag ctgatgtcag ttactggtgt acatatgaag 60 
atattacctt ttgttgccca aggaggagtt gcaactgatt ttttcatttt cttttgcatt 120 
atttcttttt ctacatttat atactatgta tcagtcaatg ttacacaaga aagacaatac 180 
attacgtcat tgatgacaat gatgggactc cgagagtcag cattctg 227 



<210> 54 
<211> 142 
<212> ADN 

. <213> Homo sapiens 
<400> 54 

gctttcctgg ggtttgatgt atgctggctt catccttatc atggccactt taatggctct 60 
tattgtaaaa tctgcacaaa ttgtcgtcct gactggtttt gtgatggtct tcaccctctt 120 
tctcctctat ggcctgtctt tg 142 



<210> 55 
<211> 186 
<212> ADN 

<213> Homo sapiens 
<400> 55 

ataactttag c^ttcctgat gagtgtgttg ataaagaaac ctttccttac gggcttggtt 60 
gtgtttctcc ttattgtctt ttgggggatc ctgggattcc cagcattgta tacacatctt 120 
cctgcatttt tggaatggac tttgtgtctt cttagcccct ttgccttcac tgttgggatg 180 
gcccag 186 



<210> 56 
<211> 148 
<212> ADN 

<213> Homo sapiens 
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<400> 56 

cttatacatt tggactatga tgtgaattct aatgcccact tggattcttc acaaaatcca 60 
tacctcataa tagctactct tttcatgttg gtttttgaca cccttctgta tttggtattg 120 
acattatatt ttgacaaaat tttgcccg 148 



<210> 57 

<211> 169 

<212> ADN 

<213> Homo sapiens 

<400> 57 

ctgaatatgg acatcgatgt tctcccttgt ttttcctgaa atcctgtttt tggtttcaac 60 
acggaagggc taatcatgtg gtccttgaga atgaaacaga ttctgatcct acacctaatg 120 
actgttttga accagtgtct ccagaattct gtgggaagga agccatcag 169 



<210> 58 

<211> 59 

<212> ADN 

<213> Homo sapiens 

, <400> 58 

aatcaaaaat cttaaaaaag aatatgcagg gaagtgtgag agagtagaag ctttgaaag 59 



<210> 59 

<211> 111 

<212> ADN 

<213> Homo sapiens 

<400> 59 

gtgtggtgtt tgacatatat gaaggccaga tcactgccct ccttggtcac agtggagctg 60 
gaaaaactac cctgttaaac atacttagtg ggttgtcagt tccaacatca g 111 

<210> 60 
<211> 176 
' <212> ADN 
<213> Homo sapiens 

<400> 60 

gttcagtcac tgtctataat cacacacttt caagaatggc tgatatagaa aatatcagca 60 
agttcactgg attttgtcca caatccaatg tgcaatttgg atttctcact gtgaaagaaa 120 
acctcaggct gtttgctaaa ataaaaggga ttttgccaca tgaagtggag aaagag 176 



<210> 61 

<211> 120 

<212> ADN 

<213> Homo sapiens 

<400> 61 

gtacaacgag ttgtacagga attagaaatg gaaaatattc aagacatcct tgctcaaaac 60 
ttaagtggtg gacaaaatag gaaactaact tttgggattg ccattttagg agatcctcaa 120 
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<210> 62 
<211> 139 

<212> ADN . 

<213> Homo sapiens ' * - 

<400> 62 

gttttgctat tggatgaacc gactgctgga ttggatcctc tttcaaggca ccgaatatgg 60 

aatctcctga aagaggggaa atcagacaga gtaattctct tcagcaccca gtttatagat 120 
gaggctgaca ttctggcgg 139 



<210> 63 . . 

.<211> 91 
<212> ADN 
<213> Homo sapiens 

<400> 63 

acaggaaggt gttcatatcc aatgggaagc tgaagtgtgc aggctcttct ctgttcctta 60 
agaagaaatg gggcataggc taccatttaa g 91 



<210> 64 . 
<211> 140 
<212> ADN 

<213> Homo sapiens 
<400> 64 

tttgcatctg aatgaaaggt gtgatccaga gagtataaca tcactggtta agcagcacat 60 
ctctgatgcc aaattgacag cacaaagtga agaaaaactt gtatatattt tgcctttgga 120 
aaggacaaac alaatttccag 140 



<210> 65 

<211> 120 . 

<212> ADN 

<213> Homo sapiens 

<400> 65 ' • . 

aactttacag ggatcttgat agatgttcta accaaggcat tgaggattat ggtgtttcca 60 
taacaacttt gaatgaggtg tttctgaaat tagaaggaaa atcaactatt gatgaatcag 120 



<210> 66 
<211> 199 
<212> ADN 

<2i3> Homo sapiens 
<400> 66 : 

atattggaat ttggggacaa ttacaaactg atggggcaaa agatatagga agccttgttg 60 
agctggaaca agttttgtct tccttccacg aaacaaggaa aacaatcagt ggcgtggcgc 120 
tctggaggca gcaggtctgt gcaatagcaa aagttcgctt cctaaagtta aagaaagaaa 180 
gaaaaagcct gtggactat 199 



<210> 67 

<211> 167 

<212> ADN 

<213> Homo sapiens 
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<400> 67 

attattgctt tttggtatta gctttatccc tcaacttttg gaacatctat tctacgagtc 60 

atatcagaaa agttacccgt gggaactgtc tccaaataca tacttcctct caccaggaca 120 
acaaccacag gatcctctga cccatttact ggtcatcaat aagacag 167 



<210> 68 
<211> 134 
<212> ADN 

<213> Homo sapiens 
<40b> 68 

ggtcaaccat tgataacttt ttacattcac tgaggcgaca gaacatagct atagaagtgg 60 
atgcctttgg aactagaaat ggcacagatg acccatctta caatggtgct atcattgtgt 120 
caggtgatga aaag 134 



<210> 69 

<211> 138 

<212> ADN 

<213> Homo sapiens 

<400> 69 , . 

gatcacagat tttcaatagc atgtaataca aaacggctga attgctttcc tgtcctcctg 60 
gatgtcatta gcaatggact acttggaatt tttaattcgt cagaacacat tcagactgac 120 
agaagcacat tttttgaa 138 



<210> 70 

<211> 108 . 

<212> ADN 

<213> Homo sapiens 

<400> 70 • 

gagcatatgg attatgagta tgggtaccga agtaacacct tcttctggat accgatggca 60 
gcctctttca ctccatacat tgcaatgagc agcattggtg actacaaa 108 



<210> 71 ■ ^ 

<211> 174 
<212> ADN 

<213> Homo sapiens 
<400> 71 . 

aaaaaagctc attcccagct acggatttca ggcctctacc cttctgcata ctggtttggc 60 
caagcactgg tggatgtttc cctgtacttt ttgatcctcc tgctaatgca aataatggat 120 
tatattttta gcccagagga gattatattt ataattcaaa acctgttaat tcaa 174 



<210> 72 

<211> 114 

<212> ADN 

<213> Homo sapiens 

<400> 72 

atcctgtgta gtattggcta tgtctcatct cttgttttct tgacatatgt gatttcattc 60 
atttttcgca atgggagaaa aaatagtggc atttggtcat ttttcttctt aatt 114 
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<210> 73 
<211> 120 
<212> ADN 

<213> Homo sapiens 
<400> 73 

gtggtcatct tctcgatagt tgctactgat ctaaatgaat atggatttct agggctattt 60 
tttggcacca tgttaatacc tcccttcaca ttgattggct ctctattcat tttttctgag 120 



<210> 74 . 

<211> 69 

<212> ADN 

<213> Homo sapiens 

<400> 74 . . 

atttctcctg attccatgga ttacttagga gcttcagaat ctgaaattgt atacctggca 60 
ctgctaata 69 



<210> 75 

<211> 92 

<212> ADN 

<213> Homo sapiens 

<400> 75 

ccttaccttc attttctcat ttttcttttc attctgcgat gcctagaaat gaactgcagg 60 
aagaaactaa tgagaaagga tcctgtgttc ag 92 



<210> 76 

<211> 121 

<212> ADN 

<213> Homo sapiens 

<400> 76 , ' 

aatttctcca agaagcaacg ctatttttcc aaacccagaa gagcctgaag gagaggagga 60 

agatatccag atggaaagaa tgagaacagt gaatgctatg gctgtgcgag actttgatga 120 

g 121 



<210> 77 

<211> 118 

<212> ADN . 

<213> Homo sapiens 

<400> 77 

acacccgtca tcattgccag ctgtctacgg aaggaatatg caggcaaaaa gaaaaattgc 60 
ttttctaaaa ggaagaaaac aattgccaca agaaatgtct ctttttgtgt taaaaaag 118 



<210> 78 

<211> 92 

<212> ADN 

<213> Homo sapiens 
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<400>.78 

gtgaagttat aggactgtta ggacacaatg gagctggtaa aagtacaact attaagatga 60 
taactggaga cacaaaacca actgcaggac ag 92 



<210> 79 
<2il> 161 
.<212> ADNT 
<213> Homo sapiens 

<400> 79 • 

gtgattttga aagggagcgg tggaggggaa cccctgggct tcctggggta ctgccctcag 60 

gagaatgcgc tgtggcccaa cctgacagtg aggcagcacc tggaggtgta cgctgccgtg 120 

aaaggtctca ggaaagggga cgcaatgatc gcbatcacac g 161 



<210> 80 

<211> 76 . 

<212> ADN 

<213> Homo sapiens 

<400> 80 

gttagtggat gcgctcaagc tgcaggacca gctgaaggct cccgtgaaga ccttgtcaga 60 
gggaataaag cgaaag 76 



<210> 81 
<211> 95 
<212> ADN 

<213> Homo sapiens 
<400> 81 

ctgcgctttg tgctgagcat cctggggaac ccgtcagtgg tgcttctgga tgagccgtcg 60 
accgggatgg accccgaggg gcagcagcaa atgtg 95 



<210> 82 

<211> 120 

<212> ADN 

<213> Homo sapiens 

<400> 82 

gcaggtgatt cgggccacct ttagaaacac ggagaggggc gccctcctga ccacccacta 60 
catggcagag gctgaggcgg tgtgtgaccg agtggccatc atggtgtcag gaaggctgag 120 



<210> 83 
<211> 141 
<212> ADN 

<213> Homo sapiens 
<400> 83 

atgtattggt tccatccaac acctgaaaag caaatttggc aaagactacc tgctggagat 60 
gaagctgaag aacctggcac aaiatggagcc cctccatgca gagatcctga ggcttttccc 120 
ccaggctgct cagcaggaaa g 141 



<210> 84 
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<211> 80 
<212> ADN 

<213> Homo sapiens 
<400> 84 

gttctcctcc ctgatggtct ataagttgcc tgttgaggat gtgcgacctt tatcacaggc 60 
tttcttcaaa ttagagatag 80 

<210> 85 
<211> 56 
<212'> ADN 

<213> Homo sapiens 
<400> 85 

ttaaacagag tttcgacctg gaggagtaca gcctctcaca gtctaccctg gagcag 56 

<210> 86 
<211> 1062 
<212> ADN 

<213> Homo sapiens 
<400> 86 

gttttcctgg agctctccaa ggagcaggag ctgggtgatc ttgaagagga ctttgatccc 60 
tcggtgaagt ggaaactcct cctgcaggaa gagccttaaa gctccaaata ccctatatct 120 
ttctttaatc ctgtgactct tttaaagata atattttata gccttaatat gccttatatc 180 
agaggtggta caaaatgcat ttgaaactca tgcaataatt atcctcagta gtatttctta 240 
cagtgagaca acaggcaatg tcagtgaggg cgatcgtagg gcataagcct aagccatacc 300 
atgcagcctt tgtgccagca accaaatccc atgtttccta ctgtgttaag tttaaaaatg 360 
catttattat agaattgtct acatttctga ggatgtcatg gagaatgctt aattttcttt 420 
ctctgaactt caaaatatta aatattttct tatttttttg attaaagtat aaattaagac 480 
accctattga cttccgggta aggggagtca attgattacc cagcagcaca gtatttgctt 540 
tttataattc cctttttaaa tacttgttct taattgactg gttttccttt tctgtcattt 600 
ttcagagttt agattgtgag tccatgtttt gtctgttgtg cctataaagg aaatttgaaa 660 
tctgtatcat tctactataa agacacatgc acacgtatgt ttattgcagc actgtttaca 720 
atagcaaaga cttggaacca accaaaatac ccacaaatga tagaccggat aaagaaaacg 780 
tgacacatat acaccatgga atactatgca gccatagaaa aggatgagtt catattcttc 840 
acagggacat ggatgaagct ggaaaccatc atcctcagca aactaacaca ggaacagaaa 900 
accaaacacc gcatgttctc actcataagt gggaattgaa caatgagaat acatggacac 960 
agggagggga acaccacacc ctggggcctg ttggggggat gggggctagg ggagggatag 1020 
cattaggaga aatacctgat gtagatgatg ggttgatggg tg 1062 



<210> 87 
<211> 287 
<212> ADN 

<213> Homo sapiens 
<400> 87 

aattaatttt acttaggata agtgttgtta ttattgtttt tattgttgtt ctgttagtta 
ctcaaaactt cattctaatt gtgccctgag tttgttaaaa taccatactg tatttttgtg 
taacatgtaa ataggcatta atttttgaga aatagaaatg tttatcctta atgtattttt 
aatttgctaa cattgatttt ttattttctt tcctgaaata gcttatttcc taaaatgaaa 
gaatttattc tcagatgaat aatttttata tcagctattc ttatcag 
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<211> 280 
<212> ADN 

<213> Homo sapiens 
<400> 88 

agcaataaac aaataccaat gatgcgctca gccaacaatt cattacactc tctgaagagt 60 
aactggacaa ggagaaaaac atagggaaaa aaccaacaga atttgttggc atgttctaca 120 
cacagaccat ggcttttcag aagccaagct gaataaaaac agttttaaaa gaggcaacca 180 
tttgtagagg agtccttgaa ggattcttca ttgttttctt ggacaaaaag agaccagtgg 240 
atccaagtgc ttcaaatact tctctcttat tttcttaact 280 



.<210> 89 
<211> 141 
<212> ADN 
<213> Homo sapiens 

<400> 89 . 

ctattgctct gcaatattta ctttaccctg ttaatgaaca ggacaaaatg gttaaaaaag 60 

agataagcgt . gcgtcaacaa attcaggctc ttctgtacaa gaattttctt aaaaaatgga 120 
gaataaaaag agagtttatt g . 141 



<210> 90 

<211> 205 

<212> ADN 

<213> Homo sapiens 

<400> 90 

gaatggacaa taacattgtt tctagggcta tatttgtgca tcttttcgga acacttcaga 60 
gctacccgtt ttcctgaaca acctcctaaa gtcctgggaa gcgtggatca gtttaatgac 120 
tctggcctgg tagtggcata tacaccagtc agtaacataa cacaaaggat aatgaataag 180 
atggccttgg cttcctttat gaaag 205 



.<210> 91 
<211> 165 
<212> ADN 

<213> Homo sapiens 
<400> 91 

gaagaacagt cattgggaca ccagatgaag agaccatgga tatagaactt ccaaaaaaat 60 
accatgaaat ggtgggagtt atatttagtg atactttctc atatcgcctg aagtttaatt 120 
ggggatatag aatcccagtt ataaaggagc actctgaata cacag 165 



<210> 92 

<211> 104 

<212> ADN 

<213> Homo sapiens 

<400> 92 

aacactgttg ggccatgcat ggtgaaattt tttgttactt ggcaaagtac tggctaaaag 60 
ggtttgtagc ttttcaagct gcaattaatg ctgcaattat agaa 104 



<210> 93 
<211> 227 
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<212>.ADN 

<213> Homo sapiens 

<400> 93 

gtcacaacaa atcattctgt aatggaggag ttgacatcag ttattggaat aaatatgaag 60 

ataccacctt tcatttctaa gggagaaatt atgaatgaat ggtttcattt tacttgctta 120 

gtttctttct cttcttttat atactttgca tcattaaatg ttgcaaggga aagaggaaaa 180 

tttaagaaac tgatgacagt aatgggtctc cgagagtcag cattctg 227 



<210> 94 

<211> 142 

<212> ADN 

<213> Homo sapiens 

<400> 94 , . 

gctctcctgg ngattgacat acatttgctt catcttcatt atgtccattt ttatggctct 60 
ggtcataaca tcaatctcaa ttgtatttca tactggcttc atggtgatat tcacactcta 120 
tagcttatat ggcctttctt tg 142 



<210> 95 ■ 
<211> 186 . 
<212> ADN 

<213> Homo sapiens 
,<400> 95 

atagcattgg ctttcctcat gagtgtttta ataaggaaac ctatgctcgc tggtttggct 60 
ggatttctct tcactgtatt ttggggatgt ctgggattca ctgtgttata cagacaactt 120 
cctttatctt tgggatgggt attaagtctt cttagccctt ttgccttcac tgctggaatg 180 
gcccag 186 



<210> 96 
<211> 148 
<212> ADN 

<213> Homo sapiens 
<400> 96 . 

gttacacacc tggataatta cttaagtggt gttatttttc ctgatccctc tggggattca 60 
tacaaaatga tagccacttt tttcattttg gcatttgata ctcttttcta tttgatattc 120 
acattatatt ttgagcgagt tttacctg 148 



<210> 97 
<211> 169 

<212> ADN 

<213> Homo sapiens 
<400> 97. 

ataaagatgg ccatggggat tctccattat ttttccttaa gtcctcattt tggtccaaac 60 
atcaaaatac tcatcatgaa atctttgaga atgaaataaa tcctgagcat tcctctgatg 120 
attcttttga accggtgtct ccagaattcc atggaaaaga agccataag . 169 



<210> 98 
<211> 59 
<212> ADN 
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<213> Homo sapiens 
<400> 98 

aatcagaaat gttataaaag aatataatgg aaagactgga aaagtagaag cattgcaag 59 



<210> 99 

<211> 111 . 

<212> ADN 

<213> Homo sapiens 

<400> 99 

gcatattttt tgacatatat gaaggacaga tcactgcaat acttgggcat aatggagctg 60 
gtaaatcaac actgctaaac attcttagtg gattgtctgt ttctacagaa g ill 



<21G> 100 
<211> 176 

•<212> ADN 
<213> Homo sapiens 

<400> 100 

gatcagccac tatttataat actcaactct ctgaaataac tgacatggaa gaaattagaa 60 
agaatattgg attttgtcca cagttcaatt ttcaatttga cttcctcact gtgagagaaa 120 
acctcagggt atttgctaaa ataaaaggga ttcagccaaa ggaagtggaa caagag 176 



<210> 101 

<211> 120 . 

<212> ADN 

<213>. Homo sapiens 

<400> 101 

gtaaaaagaa ttataatgga attagacatg caaagcattc aagacattat tgctaaaaaa 60 
ttaagtggtg ggcagaagag aaaactaaca ctagggattg ccatcttagg agatcctcag 120 



<210>,102 
<211> 139 
<212> ADN 
<213> Homo sapiens 

<400> 102 

gttttgctgc tagatgaacc aactgctgga ttggatccct tttcaagaca ccgagtgtgg 60 

agcctcctga aggagcataa agtagaccga cttatcctct tcagtaccca attcatggat 120 
gaggctgaca tcttggctg 139 



<210> 103 

<211> 91 

<212> ADN 

<213> Homo sapiens 



<400> 103 

ataggaaagt atttctgtct aatgggaagt tgaaatgtgc aggatcatct ttgtttctga 60 
agcgaaagtg gggtattgga tatcatttaa g 91 
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<2i0> 104 
<211> 140 
<212> ADN 

<213>: Homo sapiens 

<400> 104 

tttacacagg aatgaaatgt gtgacacaga aaaaatcaca tcccttatta agcagcacat 60 
tcctgatgcc aagttaacaa cagaaagtga agaaaaactt gtatatagtt tgcctttgga 120 
aaaaacgaac aaatttccag 140 



<210> 105 
<211> 120 
<212> ADN 

<213> Homo sapiens 
<400> 105 

atctttacag tgaccttgat aagtgttctg accagggcat aaggaattat gctgtttcag 60 
tgacatctct gaatgaagta ttcttgaacc tagaaggaaa atcagcaatt gatgaaccag 120 



<210> 106 
<2il> 199 
<212> ADN 

<213> Homo sapiens 
<4b0> 106 

attttgacat tgggaaacaa gagaaaatac atgtgacaag aaatactgga gatgagtctg 60 
aaatggaaca ggttctttgt tctcttcctg aaacaagaaa ggctgtcagt agtgcagctc 120 
tctggagacg acaaatctat gcagtggcaa cacttcgctt cttaaagtta aggcgtgaaa 180 
ggagagctct tttgtgttt 199 



<210> 107 
<211> 167 
<212> ADN 

<213> Homo sapiens 
<400> 107 

gttactagta cttggaattg cttttatccc catcattcta gagaagataa tgtataaagt 60 
aactcgtgaa actcattgtt gggagttttc acccagtatg tatttccttt ctctggaaca 120 
aatcccgaag acgcctctta ccagcctgtt aatcgttaat aatacag 167 



<210>.108 

<211> 134 . 

<212> ADN 

<213> Homo sapiens 

<400> 108 

gatcaaatat tgaagacctc gtgcattcac tgaagtgtca ggatatagtt ttggaaatag 60 
atgactttag aaacagaaat ggctcagatg atccctccta caatggagcc atcatagtgt 120 
ctggtgacca gaag 134 



<210> 109 

<211> 138 
<212> ADN 
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. <400> 109 

gattacagat tttctgttgc gtgtaatacc aagaaattga attgttttcc tgttcttatg 60 
ggaattgtta gcaatgccct tatgggaatt tttaacttca cggagcttat tcaaacggag 120 
agcacttcat tttctcgt 138 



<210> 110 
<211> 108 
<212>.ADN 

<213> Homo sapiens 
<400> 110 

gatgacatag tgctggatct tggttttata gatgggtcca tatttttgtt gttgatcaca 60 
aactgcgttt ctccttttat cggcatgagc agcatcagcg attataaa 108 



<210> 111 
<211> 171 
<212> ADN 

<213> Homo sapiens 
<400> 111 

aaaaatgttc aatcccagtt atggatttca ggcctctggc cttcagcata ctggtgtgga 60 
caggctctgg tggacattcc attatacttc ttgattctct tttcaataca tttaatttac 120 
tacttcatat ttctgggatt ccagctttca tgggaactca tgtttgtttt g 171 



<210> 112 
<211> 114 . 

<212> ADN 

<213> Hoitio sapiens 
;<400> 112 . 

gtggtatgca taattggttg tgcagtttct cttatattcc tcacatatgt gctttcattc 60 
atctttcgca agtggagaaa aaataatggc ttttggtctt ttggcttttt tatt 114 



.<210> 113 
<211> 120 
<212> ADN 
<213> Homo sapiens 

<400> 113 

atcttaatat gtgtatccac aattatggta tcaactcaat atgaaaaact caacttaatt 60 
ttgtgcatga ttttcatacc ttccttcact ttgctggggt atgtcatgtt attgatccag 120 



<210> 114 

<211> 81 

<212> ADN 

<213> Homo sapiens 

.<400> 114 

ctcgacttta tgagaaactt ggacagtctg gacaatagaa taaatgaagt caataaaacc 60 
attcttttaa caaccttaat a 81 
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<210> 115 
<211> 92 
<212> ADN 

<213> Homo sapiens 
<400> 115 

ccataccttc agagtgttat. tttccttttt gtcataaggt gtctggaaat gaagtatgga 60 
aatgaaataa tgaataaaga cccagttttc ag 92 



<210> 116 
<211>. 121 
<212> ADN 

<213> Homo sapiens 
<46o> 116 . 

aatctctcca cggagtagag aaactcatcc caatccggaa gagcccgaag aagaagatga 60 
agatgttcaa gctgaaagag tccaagcagc aaatgcactc actgctccaa acttggagga 120 
g . 121 



<210> 117 
<211> 118 
<212>\ADN 

<213> Homo sapiens 
<400> 117 

gaaccagtca taactgcaag ctgtttacac aaggaatatt atgagacaaa gaaaagttgc 60 
ttttcaacaa gaaagaagaa aatagccatc agaaatgttt ccttttgtgt taaaaaag 118 



<210> 118 
<211> 92 
<212> ADN 

<213> Homo sapiens 
<400> 118 

gtgaagtttt gggattacta ggacacaatg gagctggtaa aagtacttcc attaaaatga 60 
taactgggtg cacaaagcca actgcaggag tg . 92 



<210> 119 
<211> 179 
<212> ADN 

'<213> Homo sapiens 
<400> 119 

gtggtgttac aaggcagcag agcatcagta aggcaacagc atgacaacag cctcaagttc 60 
ttggggtact gccctcagga gaactcactg tggcccaagc ttacaatgaa agagcacttg 120 
gagttgtatg cagctgtgaa aggactgggc aaagaagatg ctgctctcag tatttcacg 179 



<210> 120 
<211> 76 
<212> ADN 

<213> Homo sapiens 
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<400> 120 

attggtggaa gctcttaagc tccaggaaca acttaaggct cctgtgaaaa ctctatcaga 60 
gggaataaag agaaag 76 



<210> 121 

<211> 95 

<212> ADN 

<213> Homo sapiens 

<400> 121 

ctgtgctttg tgctgagcat cctggggaac ccatcagtgg tgcttctaga tgagccgttc 60 
accgggatgg accccgaggg gcagcagcaa atgtg 95 



<210> 122 
<211> 120 
<212> ADN 

<213> Homo sapiens 
<400> 122 

gcagatactt caggctaccg ttaaaaacaa ggagaggggc accctcttga ccacccatta 60 
catgtcagag gctgaggctg tgtgtgaccg tatggccatg atggtgtcag gaacgctaag 120 

<210>.123 

<211> 141. 

<212> ADN 

<213> Homo sapiens 

<400> 123 

gtgtattggt tccattcaac atctgaaaaa caagtttggt agagattatt tactagaaat 60 

aaaaatgaaa gaacctaccc aggtggaagc tctccacaca gagattttga agcttttccc 120 

acaggctgct tggcaggaaa g 141 



<210> 124 

<211> 80 

<212> ADN 

<213> Homo sapiens 

<400> 124 

atattcctct ttaatggcgt ataagttacc tgtggaggat gtccaccctc tatctcgggc 60 
ctttttcaag ttagaggcga 80 



<210> 125 

<211> 56 

<212> ADN 

<213> Homo sapiens 

<400> 125 

tgaaacagac cttcaacctg gaggaataca gcctctctca ggctaccttg gagcag 56 



<210> 126 

<211> 769 
<212> 2VDN 
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<400> 126 

gtattcttag aactctgtaa agagcaggag ctgggaaatg ttgatgataa aattgataca 60 
acagttgaat ggaaacttct cccacaggaa gacccttaaa atgaagaacc tcctaacatt 120 
caattttagg tcctactaca ttgttagttt ccataattct acaagaatgt ttccttttac 180 
ttcagttaac aaaagaaaac atttaataaa cattcaataa tgattacagt tttcattttt 240 
aaaaatttag gatgaaggaa acaaggaaat atagggaaaa gtagtagaca aaattaacaa 300 
aatcagacat gttattcatc cccaacatgg gtctattttg tgcttaaaaa taatttaaaa 360 
atcatacaat attaggttgg ttttcggtta ttatcaataa agctaacact gagaacattt 420 
tacaaataaa aatatgagtt ttttagcctg aacttcaaat gtatcagcta tttttaaaca 480 
ttatttactc ggattctaat ttaatgtgac attgactata agaaggtctg ataaactgat 540 
gaaatggcac agcataacat ttaattataa tgacattctg attataaaat aaatgcatgt 600 
gaattttagt acatattgaa gttatatgga agaagatagc cataatctgt aagaaagtac 660 
cgcagttaat attttcttta gccaacttat attcaatgta ttttttatgg atcctttttc 720 
aaaggtagta tcagtaggca tagtcatttt ctgtatcttt tcacctcac 769 



<210> 127 
<211> 19 . 
<212> ADN 

<213> Homo sapiens 
<400> 127 

cagtgactat gtatccgtg 19 



<210> 128 
<211> 19 
. <212> ADN 
<213> Homo sapiens 

<400> 128 

gatggtttct cctcacaac 19 



<210> 129 

,<211> 19 • 
<212> ADN 

<213f> Homo sapiens . 
<400> 129 

caccagacaa tgaggatga 19 



<210> 130 

<211> 19 
<212> ADN 

<213> Homo sapiens 
<400> 130 - 

gctatattct tcaatggca . 19 



<210> 131 
<211> 19 
<212> ADN 

<213> Homo sapiens 
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<400> 131 

cctagaagta gaccgcctt 19 



<210> 132 
<211> 19 
<212> ADN 

<213> Homo sapiens 
<400> 132 

gttgtgagga gaaaccatc 19 



<210> 133 
<211> 19 
<212> ADN 

<213> Homo sapiens 
<400> 133 

ctggatggtt tcagtcaca 19 



<210> 134 
<211> 19 
<212> ADN 

<213> Homo sapiens 

<400> 134 

cagaaaagcc aatcgggtg * 19 



<210> 135 

<211> 23 ' 

<212> ADN 

<213> Homo sapiens 
<400> 135 

ccaggtatat gttgtttaac cag 23 



<210> 136 
<211> 20 
<212> ADN 

<213> Homo sapiens 
<400> 136 

gggtcagatt actgccttac ' 20 



<210> 137 

<211> 20 

<212> ADN 

<213> Homo sapiens 



<4bO> 137 

gaacattgaa gaaccaacac 



20 
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<210> 138 
<211> 20 
<212> ADN 

<213> Homo sapiens 
<400> 138 

gtaaggcagt aatctgaccc 20 



<210> 139 
<211> 18. 
<212> ADN 

<213> Homo sapiens 
<400> 139 

ggaaactgga cagaatgc 18 



<210> 140 
<211> 19 
<212> ADN 

<213> Homo sapiens 
<400> 140 

ctaccctatt tcacatgcc 19 



<210> 141 

<211> 20 . . 

<212> ADN 

<213> Homo sapiens 
<400> 141 

gtttctccca taataacagc 20 



. <210> 142 

<211> 20 . . 
<212> ADN 

<213> Homo sapiens 
<400> 142 

gctgttatta tgggagaaac 20 



<210> 143 
<211> 30 
<212> ADN 

<213> Homo sapiens 

<400> 143 

agactacagt aacaaaagcc tagtgcagcc 30 



<210> 144 
<211> 30 
<212> ADN 
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<400> 144 

atccaatcct attagtgtga caaaggcttg 30 



<210> 145 
<211> 20 
<212> ADN 

<213> Homo sapiens 
<400> 145 

tcagcaaacc aaagcacttc 20 



<210> 146 

<2il> 27 

<212> ADN 

<2i3> Homo sapiens 

<400> 146 

caagtgctgt tttattcatt atctgct 27 



<210> 147 

<211> 28 . 
<212> ADN 

<213> Homo sapiens 
.<400> 147 

gtacatgaaa actcaccata tccatccc 28 



<210> 148 

<211> 20 

<212> ADN 

<213> Homo sapiens 

<400> 148 

tcattgctgg gatggatatg 20 



<210> 149 
<211> 19 / • 
<212> ADN ■ 
<213> Homo sapiens 

<400> 149 

ccctgtgatg gaggagttg 19 



<210> 150 

<211> 21 

<212> ADN 

<213> Homo sapiens 



<400> 150 

tgacatcaac tcctccatca c 



21 
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<210> 151 

<211> 20 

<212> ADN 

<213> Homo sapiens 

<400> 151 

gaatgctgcia tcttggagac 20 



<210> 152 

<2il> 20 

<212> ADN 

<213> Homo sapiens 

<400> 152 

gattcagatt atcaaactgg 20 



<210> 153 

<211> 21 

<212> ADN 

<213> Homo sapiens 

<400> 153 

tggtgtaatt tttcctgacc c 21 



<210> 154 
<211> 22 

<212> ADN 

. <213> Homo sapiens 
<400> 154 

aagggtcagg aaaaattaca cc 22 



<210> 155 

<211> 19 

<212> ADN 

<213> Homo sapiens 

<400> 155 

ggaattcagg agctactgg 19 



<210> 156 

<211> 22 

<212> ADN 

<213> Homo sapiens 

<40Q> 156 

gattgtctgt tccaacagaa gg . * 22 



<210> 157 
'<2il> 22 
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<212> ADN 

<213> Homo sapiens 



<400> 157 

. ccacttcctt tagatgaatc cc 



22 



<210> 158 
<211> 22 
<212> ADN 



<213> Homo sapiens 



<400> 158 

aiagtggaaca agaggtacaa eg 



22 



<210> 159 
<211> 21 

<212> ADN 

<213> Homo sapiens 

<400> 159 

atggtaatcc caaaagtcag c 21 



<210> 160 . 

<2il> 22 
<212> ADN 

<213> Homo sapiens 
<400>.160 

ggggatgtga tgagtaatga ag 22 



<210> 161 
<211> 22 . \ 
' <212> ADN 
<213> Homo sapiens 

<400> 161 

cttcattact catcacatcc cc 22 



<210> 162 

<211> 19 

<212> ADN 

<213> Homo sapiens 

<400> 162 

acaacttccc caggaaccc 19 



<210> 163 

<211> 19 

<212> ADN 

<213> Homo sapiens 



<400> 163 
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gatcaacagg 



ctggtacgg 



19 



<210> 164 
<211> 22 . 
<212> ADN 
<213> Homo 



sapiens 



<400> 164 
caagaaaaat 



gctaagtccc ag 



22 



<210> 165 

<211> 19 

<212>\ADN 

<213> Homo sapiens 

<400> 165 

tgcccacacG agtaagcag 19 

<210> 166 - . 

<211> 22 

<212> ADN 

<213> Homo sapiens 

<400> 166 

gaaaatcagt ggcactcaat tc 22 

.<210> 167 
<211> 22 
<212> ADN 
<213> Homo sapiens 

<400> 167 

tgccactgat tttctagtct gc . 22 



<210> 168 
<211> 19 

<212> ADN 

<213> Homo sapiens 

<400> 168 - 

ctgggatcac aaagccaac 19 



<210> 169 

<211> 20 

<212> ADN 

<213> Homo sapiens 

<400> 169 

cctttcagtt ccacctctcc 20 



<210> 170 
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<211> 21 

<212> ADN 

<213> Homo sapiens 

<400> 170 

tccacactga gattctgaag c 21 



<210> 171 
<211> 19 
<212> ADN 

<213> Homo . sapiens 
<400> 171 

aatacctttc ctgccctgc 19 



<210> 172 
<211> 19 
<212> ADN 

<213> Homo sapiens 
<400> 172 

gcctgactct ttgggtgac 19 



<210> 173 

<211> 19 . ' . 

<212> ADN 

<2i3> Homo sapiens 
<400> 173 

tgagcgtggg tcagcaaac 19 



<210> 174 
<211> 20 ' 
<212> ADN 

<213> Homo sapiens < 
<400> 174 

gcaactcctc cttgggcaac 20 



<210> 175 

<211> 19 

<212> ADN 

<213> Homo sapiens 

<400> 175 

tttgttgccc aaggaggag 19 



<210> 176 

<211> 22 

<212> ADN 

<213> Homo sapiens 
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<400> 176 

ggaaaaacaa gggagaacat eg 22 



<210> 177 

<211> 19 . 

<212> MN 

<213> Homo sapiens 

<400> 177 

gcccacttgg attcttcac 19 



<210> 178 
<211> 22 
<212> ADN 

<213> Homo sapiens 
<400> 178 

ccacaccttt caaagcttct ac 22 



<210> 179 
<211> 22 
<212> ADN 

<213> Homo sapiens 
<400> 179 

atgtggtcct tgagaatgaa ac . 22 



<210> 180 
<211> 22 
<212> ADN 

<213> Homo sapiens 
<400> 180 

actgtgaaag aaaacctcag go 22 



<210> 181 

<211> 19 

<212> ADN 

<213> Homo sapiens 

<400> 181 

cttcatgtgg caaaatccc 19 



<210> 182 
<211> 20 
<212> ADN 

<213> Homo sapiens 



<400> 182 

tgtgctgtca atttggcatc 



20 
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<210> 183 

<i211> 20 

<212>.ADN 

<2i3> Homo sapiens 

<400> 183 

aagaagaaat ggggcatagg 20 



<210> 184 

<211> 22 

<212> M)N 

<213> Homo sapiens 

<400> 184 

tgtatttgga gacagttccc ac 22 



<210> 185 

<211> 19 

<212> ADN 

<213> Homo sapiens 

<400> 185 

aacaatcagt ggcgtggcg 19 



<210> 186 

<211> 22 

<212> ADN 

<213> Homo sapiens 

<400> 186 

gacatccagg aggacaggaa ag 22 



<210> 187 

<211> 21 

<212> ADN 

<213> Homo sapiens 

<400> 187 

gcagcctctt tcactccata c 21 



<210> 188 

<211> 22 

<212> ADN 

<213> Homo sapiens 

<400> 188 

cattgtgtca ggtgatgaaa ag 22 

<210> 189 

<211> 20 

<212> ADN 

<213> Homo sapiens 
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<400> X89 

ttcatttcta ggcatcgcag 20 



<210> 190 

<211> 22 . 

<212> ADN 

<213> Homo sapiens 

<400> 190 

cattagcagg aggatcaaaa ag 22 



<210> 191 

<211> 21 

<212> ADN 

<213> Homo ' sapiens 

<400> 191 

tctagggcta ttttttggca c . 21 



<210> 192 

<211> 19 

<212> ADN 

<213> Homo sapiens 

<400> 192 

cgctcccttt caaaatcac 19 



<210> 193 

<211> 21 

<212> ADN 

<213> Homo sapiens 

<400> 193 

tgcgagactt tgatgagaca c 21 



<210> 194 

<211> 20 

<212> ADN 

<213> Homo sapiens 

<400> 194 

agaccatcag ggaggagaac 20 



<210> 195 

<211> 18 

<212> ADN 

<213> Homo sapiens 



<400> 195 

tgtgccagca accaaatc 



18 
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<210> 196 
<211> 22 
<212> ADN 

<213> Homo sapiens 
<400> 196 

gctggagatg aagctgaaga ac 22 



<210> 197 

<211> 18 . , 

<212> ADN 

<213> Homo sapiens 

<400> 197 

tttccacttc accgaggg 18 



<210> 198 
<211> 21 
<212> ADN 

.<213> Homo sapiens 
<400> 198 

ccatgttttg tctgttgtgc c . 21 



<210> 199 

<211> 22 

<212> ADN 

<213> Homo sapiens 

<400> 199 

cacccatcaa cccatcatct ac 22 



<210> 200 
<211> 22 
<212>.ADN 

<213> Homo sapiens 
<400> 200 

aggcacaaca gacaaaacat gg 22 



<210> 201 
<211> 22 . 
<212> ADN 

<213> Homo sapiens 
<400> 201 

aagcatgatg tagtagtgac cc 22 



<210> 202 

.<211> 25 
<212> ADN 



wo 02/46458 



PCT/EPOl/15401 



64/66 



<213> Homo sapiens 



<400> 202 

cttgggtagt tttggattca ggtgc 



25 



<210> 203 
<211> 27 

<212> ADN 



<213> Homo sapiens 



<400> 203 

agatccattg aagacatttg aggagtg 



27 



<2X0> 204 
<211> 20 
<212> ADN 



<213> Homo sapiens 



<400::^ 204 

gattgacata catttgcttc 



20 



<210> 205 

<211> 20 

<212> ADN 

<213> Homo sapiens 

<400> 205 

tacagtgaag agaaiatccag 20 



.<210> 206 

<211> 18 . . 

<212> ADN 

<213> Homo sapiens 

<400> 206 

tggaattaga catgcaaa 18 

<210> 207 
<211>.19 
<212> ADN. 

<213> Homo sapiens - 
<400> 207 . 

tgaagaggat aagtcggtc 19 

<210> 208 

<211> 18 

<212> ADN 

<213> Homo sapiens 



<400> 208 

tataatcgct gatgctgc 



18 
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<210> 209 

<211> 26 

<212> ADN 

<213> Homo sapiens 

<400> 209 

accaggccag agtcattaaa ctgatc 26 



<210> 210 

<211> 25 

<212> ADN 

<213> Homo sapiens 

<400> 210 

ccgaaaagat gcacaaatat agccc 25, 



<210> 211 

.<2il> 26 . 

<212> Ai3N 

<213> Homo sapiens 

.<400> 211 

ctcaaaactt cattctaatt gtgccc 26 



<210> 212 
<211> 18 
'<212> ADN 
<213> Homo sapiens 

<400> 212 

agataagcgt gcgtcaac 18 



<210> 213 ' 
<211>. 20 . 
<212> ADN 
<213> Homo sapiens 

<400> 213 

tcttatggga attgttagca 20 



<210> 214 

<211> 19 

<212> ADN 

<213> Homo sapiens 

<400> 214 

ttatgactgg ttcctcctc 19 



<210> 215 
<211> 18 
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<212> ADN 

<213> Homo sapiens 

<400> 215 

tcatcaacat ttcccagc 18 



<210> 216 

<211> 21 
<2i2> ADN 

<213> Homo sapiens 
<400> 216 

gaaatactgg agatgagtct g 21 



<210> 217 
<211> 19 
<212> ADN 

<213> Homo sapiens 



<400> 217 

gagcttaaga gcttccacc 



19 



